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TITLE OF THE INVENTION 

SYNTHETIC GENE ENCODING HUMAN CARCINOEMBRYONIC ANTIGEN AND USES 
THEREOF 

5 FIELD OF THE INVENTION 

The present invention relates generally to the therapy of cancer. More specifically, the 
present invention relates to synthetic polynucleotides encoding the human tumor associated polypeptide 
carcinoembryonic antigen, herein designated hCEAopt, wherein the polynucleotides are codon-optimized 
for expression in a human cellular environment. The present invention also provides recombinant vectors 

10 and hosts comprising said synthetic polynucleotides. This invention also relates to adenoviral vector and 
plasmid constructs carrying hCEAopt and to their use in vaccines and pharmaceutical compositions for 
preventing and treating cancer. 

BACKGROUND OF THE INVENTION 

15 The immunoglobulin superfamily (IgSF) consists of numerous genes that code for 

proteins with diverse functions, one of which is intercellular adhesion. IgSF proteins contain at least one 
Ig-related domain that is important for maintaining proper intennolecular binding interactions. Because 
such interactions are necessary to the diverse biological functions of the IgSF members, disruption or 
aberrant expression of many IgSF adhesion molecules has been correlated with many human diseases. 

20 The carcinoembryonic antigen (CEA) belongs to a subfamily of the Ig superfamily 

consisting of cell surface glycoproteins. Members of the CEA subfamily are known as CEA-related cell 
adhesion molecules (CEACAMs). In recent scientific literature, the CEA gene has been renamed 
CEACAM5, although the nomenclature for the protein remains CEA. Functionally, CEACAMs have 
been shown to act as both homotypic and heterotypic intercellular adhesion molecules (Benchimol et al., 

25 Cell 57: 327-334 (1989)). In addition to cell adhesion, CEA inhibits cell death resulting from detachment 
of cells from the extracellular matrix and can contribute to cellular transformation associated with certain 
proto-oncogenes such as Bell and C-Myc (see Berinstein, J. Clin Oncol 20(8): 2197-2207 (2002)). 

Normal expression of CEA has been detected during fetal development and in adult 
colonic mucosa. GEA overexpression was first detected in human colon tumors over thirty years ago 

30 (Gold and Freedman, /. Exp. Med. 121:439-462 (1965)) and has since been found in nearly all colorectal 
tumors. Additionally, CEA overexpression is detectable in a high percentage of adenocarcinomas of the 
pancreas, breast and lung. Because of the prevalence of CEA expression in these tumor types, CEA is 
widely used clinically in the management and prognosis of these cancers. 

Sequences coding for human CEA have been cloned and characterized (U.S. Patent No. 

35 5,274,087; U.S. Patent No 5,571,710; and U.S. Patent No 5,843,761. See also Beauchemin et al., Mol 
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Cell. Biol 7:3221-3230 (1987); Zimmerman et al., Proc. Nail Acad Set USA 84:920-924 (1987); 

Thompson et al. Proc. Natl Acad. Set. USA 84(9):2965-69 (1987)). 

The correlation between CEA expression and metastatic growth has led to its 

identification as a target for molecular and immunological intervention for colorectal cancer treatment 
5 One therapeutic approach targeting CEA is the use of anti-CEA antibodies (see Chester et al., Cancer 

Chemother. Pharmacol 46 (Suppl): S8-S12 (2000)), while another is to activate the immune system to 

attack CEA-expressing tumors using CEA-based vaccines (for review, see Berinstein, supra). 

The development and commercialization of many vaccines have been hindered by 

difficulties associated with obtaining high expression levels of exogenous genes in successfully 
10 transformed host organisms. Therefore, despite the identification of the wild-type nucleotide sequences 

encoding CEA proteins described above, it would be highly desirable to develop a readily renewable 

source of human CEA protein that utilizes CEA-encoding nucleotide sequences that are optimized for 

expression in the intended host cell, said source allowing for the development of a cancer vaccine which 

is efficacious and not hindered by self -tolerance. 

15 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods to elicit or enhance immunity 
to the protein products expressed by CEA genes, which have been associated with numerous 
adenocarcinomas, including colorectal carcinomas. Specifically, the present invention provides 

20 polynucleotides encoding human CEA protein, wherein said polynucleotides are codon-optimized for 
high level expression in a human cell. The present invention further provides adenoviral and plasmid- 
based vectors comprising the synthetic polynucleotides and discloses use of said vectors in immunogenic 
compositions and vaccines for the prevention and/or treatment of CEA-associated cancer. 

The present invention also relates to synthetic nucleic acid molecules (polynucleotides) 

25 comprising a sequence of nucleotides that encode human carcinoembryonic antigen (hereinafter hCEA) 
as set forth in SEQ ID NO:2, wherein the synthetic nucleic acid molecules are codon-optimized for high- 
level expression in a human cell (hereinafter hCEAopt). The nucleic acid molecules disclosed herein may 
be transfected into a host cell of choice wherein the recombinant host cell provides a source for 
substantial levels of an expressed functional hCEA protein (SEQ ID NO:2). 

30 The present invention further relates to a synthetic nucleic acid molecule which encodes 

mRNA that expresses a human CEA protein; this DNA molecule comprising the nucleotide sequence 
disclosed herein as SEQ ID NO: 1 . A preferred aspect of this portion of the present invention is disclosed 
in FIGURE 1, which shows a DNA molecule (SEQ ID NO: 1) that encodes a hCEA protein (SEQ ID 
NO:2 or SEQ ID NO: 16). The preferred nucleic acid molecule of the present invention is codon- 

35 optimized for high-level expression in a human cell. 
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Another preferred DNA molecule of the present invention comprises a sequence of 
nucleotides that encodes a human CEA that is deleted of its C-tenninal anchoring domain (AD), which is 
located from about amino acid 679 to about amino acid 702 of the human full-length CEA (SEQ ED 
NO:2), wherein said sequence of nucleotides is codon-optimized for high level expression in a human 
cell. An exemplary DNA molecule encoding a CEA variant that is truncated of its anchoring domain is 
set forth in SEQ ID NO: 15 (shown in FIGURE 10A). The corresponding amino acid sequence of hCEA- 
AAD is set forth in SEQ ID NO: 16 (shown in FIGURE 10B). 

The present invention also relates to recombinant vectors and recombinant host cells, 
both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. 

The present invention further relates to a process for expressing a codon^ptimized 
human CEA protein in a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic 
acid molecule as set forth in SEQ ID NO: 1 or SEQ ID NO: 15 into a suitable host cell; and, (b) culturing 
the host cell under conditions which allow expression of said codon<>ptimized human protein. 

Another aspect of this invention is a method of preventing or treating cancer comprising 
adnnmstering to a mammal a vaccine vector comprising a synthetic nucleic acid molecule, the synthetic 
nucleic acid molecule comprising a sequence of nucleotides that encodes a human carcinoembryonic 
antigen (hCEA) protein as set forth in SEQ ID NO:2 or SEQ ID NO:16, wherein the synthetic nucleic 
acid molecule is codon-optimized for high level expression in a human cell. 

The present invention further relates to an adenovirus vaccine vector comprising an 
adenoviral genome with a deletion in the El region, and an insert in the El region, wherein the insert 
comprises an expression cassette comprising: (a) a codon-optimized polynucleotide encoding a human 
CEA protein; and (b) a promoter operably linked to the polynucleotide. 

The present invention also relates to a vaccine plasmid comprising a plasmid portion and 
an expression cassette portion, the expression cassette portion comprising: (a) a synthetic polynucleotide 
encoding a human CEA protein, wherein the synthetic polynucleotide is codon-optimized for optimal 
expression in a human cell; and (b) a promoter operably linked to the polynucleotide. 

Another aspect of the present invention is a method of protecting or treating a mammal 
from cancer or treating a mammal suffering from CEA-associated cancer comprising: (a) introducing into 
the mammal a first vector comprising: i) a codon-optimized polynucleotide encoding a human CEA 
protein; and ii) a promoter operably linked to the polynucleotide; (b) allowing a predetermined amount of 
time to pass; and (c) introducing into the mammal a second vector comprising: i) a codon-optimized 
polynucleotide encoding a human CEA protein; and ii) a promoter operably linked to the polynucleotide. 
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As used throughout the specification and in the appended claims, the singular forms "a," 
"an" and 'the" include the plural reference unless the context clearly dictates otherwise. 

As used throughout the specification and appended claims, the following definitions and 
abbreviations apply: 

5 The term "promoter" rc ^ ers to a recognition site on a DNA strand to which the RNA 

polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 
transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 
inhibiting sequences termed "silencers". 

The term "cassette" refers to the sequence of the present invention that contains the 

10 nucleic acid sequence which is to be expressed. The cassette is similar in concept to a cassette tape; each 
cassette has its own sequence. Thus by interchanging the cassette, the vector will express a different 
sequence. Because of the restriction sites at the 5* and 3' ends, the cassette can be easily inserted, 
removed or replaced with another cassette. 

The term "vector" refers to some means by which DNA fragments can be introduced into 

15 a host organism or host tissue. There are various types of vectors including plasmid, virus (including 
adenovirus), bacteriophages and cosmids. 

The term "first generation," as used in reference to adenoviral vectors, describes said 
adenoviral vectors that are replication-defective. First generation adenovirus vectors typically have a 
deleted or inactivated El gene region, and preferably have a deleted or inactivated E3 gene region. 

20 The designation "pV 1 J/hCEAopt" refers to a plasmid construct disclosed herein 

comprising the human CMV immediate-early (BE) promoter with intron A, a full-length codon<>ptimized 
human CEA gene, bovine growth hormone-derived polyadenylation and transcriptional termination 
sequences, and a minimal pUC backbone (see EXAMPLE 2). The designation "pVU/hCEA" refers to a 
construct as described above, except the construct comprises a wild-type human CEA gene instead of a 

25 codon-optimized human CEA gene. 

The designations c< MRKAd5/hCEAopt" and "MRKAd5/hCEA" refer to two constructs, 
disclosed herein, which comprise an Ad5 adenoviral genome deleted of the El and E3 regions. In the 
"MRKAd5/hCEAopt" construct, the El region is replaced by a codon-optimized human CEA gene in an 
El parallel orientation under the control of a human CMV promoter without intron A, followed by a 

30 bovine growth hormone polyadenylation signal. The "MRKAd5/hCEA" construct is essentially as 
described above, except the El region of the Ad5 genome is replaced with a wild-type human CEA 
sequence (see EXAMPLE 2). 

The term "effective amount" means sufficient vaccine composition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 

35 recognizes that this level may vary. 
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A "conservative amino acid substitution" refers to the replacement of one amino acid 
residue by another, chemically similar, amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another, 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 

5 glutamic acid for aspartic acid). 

"hCEA" and "bCEAopt" refer to a human carcinoembryonic antigen and a human codon- 

optimized carcinoembryonic antigen, respectively. 

The term "hCEA-AAD" refers to a variant of human CEA that is deleted of its C-terminal 
anchoring domain (AD), which is located from about amino acid 679 to about amino acid 702 of the 

10 human full-length CEA (SEQ ID NO:2). Nucleotide sequences encoding hCEA-AAD of the present 
invention are codon-optimized for high-level expression in a human cellular environment (designated 
herein hCEAopt-AAD"). An exemplary DNA molecule encoding a CEA variant that is truncated of its 
anchoring domain is setforth in SEQ ID NO:15 (shown in FIGURE 10A). The corresponding amino acid 
sequence of hCEA-AAD is set forth in SEQ ID NO:16 (shown in FIGURE 10B). Nucleotides encoding 

15 hCEA-AAD are useful for the development of a cancer vaccine for treatment and/or prophylaxis of 
cancer. 

The tram "mammalian" refers to any mammal, including a human being. 
The abbreviation "Ag" refers to an antigen. 

The abbreviations "Ab" and "mAb" refer to an antibody and a monoclonal antibody, 

20 respectively. 

The abbreviation "ORF' refers to the open reading frame of a gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows the nucleotide sequence of wild-type human CEA cDNA (SEQ ID 
25 NO:3) and of the codon optimized clone (hCEAopt, SEQ ID NO:l). The deduced amino acid sequence is 
shown on top (SEQ ID NO:2). The substituted nucleotides of the synthetic codon optimized cDNA are 
shown below the hCEAcDNA sequence. See EXAMPLE 2. 

FIGURE 2 shows the expression of hCEA in injected mice. Groups of 10 C57BL/6 mice 
were injected in the quadriceps muscle either with various doses of MRKAd5-hCEA and MRKAd5- 
30 hCEAopt (Panel A) or with 25 or 50 micrograms of plasmids pVU/hCEA and pVU/hCEAopt (Panel B). 
Blood samples were collected 3 days postinjection and CEA levels were measured. Filled triangles 
represent CEA measurement of individual mice. Geometric mean values are also shown (filled circle). 

FIGURE 3 shows that codon optimization increases the immune response to human 
CEA. Groups of 8 C57BL/6 mice were injected via the quadriceps muscle either with various doses of 
35 MRKAd5-hCEA and MRKAd5-hCEAopt Virus injections were carried out at 0 and 21 days. Panel A. 
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At two weeks post boosting injection, the number of CD8+ IFNy secreting T cells specific for hCEA was 
determined by ELISPOT assay on splenocytes from individual mice (filled triangles) using peptide 143 
that covers aa 569-583 and includes a CD8+ epitope. Two different amounts of splenocytes (2.5 x 105 
and 5 x 105) and two replicas of each tested amount of splenocytes. Average values were calculated by 
5 subtracting the background level determined in the absence of peptides (typically less than 10 SFC/10° 
total splenocytes), and the results were expressed as the number of SFC/106 total splenocytes. Values 
from individual mice are shown (filled triangles) as well as the geometric mean values (filled circle). 
Panel B. Anti-CEA antibody titers in sera from individual mice (filled triangles) were measured using 10 
days post boost serum samples. Geometric mean titers (GMT) (filled circles) are also shown. 
10 Ad/hCEAopt is significandy different from Ad/hCEA 

FIGURE 4. Comparison of different immunization regimens. Groups of C57BL/6 (A) or 
BALB/c (B) mice were immunized with different combinations of plasmid pVU/hCEA (50ug/dose 
electroinjected in the quadriceps muscle) and MRKAd5/hCEA ( 1x109 pp/dose). The number of IFNy- 
secreting T cells in splenocytes in each individual mouse was determined using a pool of peptides 
15 covering aa 497-703 (pool D) as described in materials and methods and in the legend to FIGURE 3. 
Geometric mean values are also shown (filled circles). D/D and D/A are significantly different from 
Ad/Ad group in C57BL/6 mice. All three groups are significandy different in BALB/c mice. 

FIGURE 5 shows the results of mapping of T-cell responses to selected regions of the 
hCEA protein. Groups of C57BL/6 (Panel A) or BALB/c (Panel B) mice were immunized with 50ng of 
20 plasmid pVU/hCEA and boosted three weeks later with lxlO? pp of Ad/hCEA. The number of IFNy- 
secreting T cells in splenocytes in each individual mouse was determined two weeks post-boost using 
pool of peptides covering the entire protein as described in materials and methods and in the legend to 
Figure 3. Geometric mean values are also shown (filled circles). 

FIGURE 6. Identification of immunoresponsive peptides of hCEA. Pooled splenocytes 
25 from 4 immunized C57BL/6 (Panel A) or BALB/c (Panel B) mice were assayed for IFNy secretion 
against each indicated peptide by ELISPOT assay (see EXAMPLE 8). 

FIGURE 7 shows the sequence of epitope containing peptides for CEA in C57BL/6 mice 
(Panel A) and BALB/c mice (Panel B) (see EXAMPLE xx). Listed to the right are the percent of 
IFNy producing CD8+ (CD4+) CD3+ cells. 
30 FIGURE 8 shows results from an IFNv-ELISPOT assay of immunized CEA transgenic 

mice as described in EXAMPLE 9. Mice were immunized with four electroinjections of plasmid DNA 
one week apart plus one adenovirus injection. For each immunogen, data were obtained with pooled 
splenocytes of three injected mice. The CD8-specific response was measured using peptide 143. 

FIGURE 9 shows IFNy -intracellular staining of immunized CEA.tg mice. Mice were 
35 immunized with 2 injections of lxlQlO vp of Adenovirus two weeks apart. Shown are the data obtained 
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with pooled splenocytes of three injected mice. Listed to the right are the percent of CD8+ or CD4+ 
cells. 

FIGURE 10, Panel A, shows an exemplary codon-optimized DNA molecule encoding a 
CEA variant that is truncated of its anchoring domain as set forth in SEQ ID NO: 15. The corresponding 
5 amino acid sequence of hCEA-AAD is shown in Panel B (SEQ ID NO: 16). 

DETAILED DESCRIPTION OF THE INVENTION 

Carcinoembryonic antigen (CEA) is commonly associated with the development of 
adenocarcinomas. The present invention relates to compositions and methods to elicit or enhance 

10 immunity to the protein product expressed by the CEA tumor-associated antigen, wherein aberrant CEA 
expression is associated with the carcinoma or its development Association of aberrant CEA expression 
with a carcinoma does not require that the CEA protein be expressed in tumor tissue at all hmepoints of 
its development, as abnormal CEA expression may be present at tumor initiation and not be detectable 
late into tumor progression or vice-versa. 

!5 To this end, synthetic DNA molecules encoding the human CEA protein are provided. 

The codons of the synthetic molecules are designed so as to use the codons preferred by the projected 
host cell, which in preferred embodiments is a human cell. The synthetic molecules may be used for the 
development of recombinant adenovirus or plasmid-based vaccines, which provide effective 
immunoprophylaxis against CEA-associated cancer through neutralizing antibody and cell-mediated 

20 immunity. The synthetic molecules may be used as an immunogenic composition. This invention 

provides polynucleotides which, when directly introduced into a vertebrate in vivo, including mammals 
such as primates and humans, induce the expression of encoded proteins within the animal. 

The wild-type human CEA nucleotide sequence has been reported (See, e.g., U.S. Patent 
No. 5,274,087; U.S. Patent No 5,571,710; and U.S. Patent No 5,843,761). The present invention provides 

25 synthetic DNA molecules encoding the human CEA protein. The synthetic molecules of the present 

invention comprise a sequence of nucleotides, wherein some of the nucleotides have been altered so as to 
use the codons preferred by a human cell, thus allowing for high-level expression of CEA in a human host 
cell. The synthetic molecules may be used as a source of CEA protein, which may be used in a cancer 
vaccine to provide effective immunoprophylaxis against CEA-associated carcinomas through neutralizing 

30 antibody and cell-mediated immunity. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 
Because these codons provide the message for only 20 different amino acids (as well as transcription 
initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 

35 uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
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variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 
the amino acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, TTA, 
and TTG. Exhaustive analysis of genome codon frequencies for microorganisms has revealed 
endogenous DNA of E. coli most commonly contains the CTG leucine-specifying codon, while the DNA 
5 of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. Inviewofthis 
hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucine- 
rich polypeptide by an E. coli host will depend to some extent on the frequency of codon use. For 
example, it is likely that a gene rich in TTA codons will be poorly expressed in E coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 
10 leucine-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 
manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 
of exogenous genes in successfully transformed host organisms-a less "preferred" codon may be 
repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 
15 efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 

projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
recombinant DNA techniques. Thus, one aspect of this invention is a human CEA gene that is codon- 
optimized for expression in a human cell. In a preferred embodiment of this invention, it has been found 
that the use of alternative codons encoding the same protein sequence may remove the constraints on 
20 expression of exogenous CEA protein in human cells. 

In accordance with this invention, the human CEA gene sequence was converted to a 
polynucleotide sequence having an identical translated sequence but with alternative codon usage as 
described by Lathe, "Synthetic Oligonucleotide Probes Deduced from Amino Acid Sequence Data: 
TheoreticalandPractical Considerations" J. Molec. Biol. 183:1-12 (1985), whichis hereby incorporated 
25 by reference. The methodology generally consists of identifying codons in the wild-type sequence that 

are not commonly associated with highly expressed human genes and replacing them with optimal codons 
for high expression in human cells. The new gene sequence is then inspected for undesired sequences 
generated by these codon replacements (e.g., "ATTTA" sequences, inadvertent creation of intron splice 
recognition sites, unwanted restriction enzyme sites, etc.). Undesirable sequences are elirninated by 
30 substitution of the existing codons with different codons coding for the same amino acid. The synthetic 
gene segments are then tested for improved expression. 

The methods described above were used to create synthetic gene sequences for human 
CEA, resulting in a gene comprising codons optimized for high level expression. While the above 
procedure provides a summary of our methodology for designing colon-optimized genes for use in 
35 cancer vaccines, it is understood by one skilled in the art that similar vaccine efficacy or increased 
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expression of genes may be achieved by minor variations in the procedure or by minor variations in the 
sequence. One of skill in the art will also recognize that additional DNA molecules may be constructed 
that provide for high levels of CEA expression in human cells, wherein only a portion of the codons of 
the DNA molecules are ccdonK)ptirnized. 

Accordingly, the present invention relates to a synthetic polynucleotide comprising a 
sequence of nucleotides encoding a human CEA protein (SEQ ID NO:2), or a biologically active 
fragment or mutant form of a human CEA protein, including, but not limited to hCEA-AAD (SEQ ED 
NO:16), the polynucleotide sequence comprising codons optimized for expression in a human host Said 
mutant forms of the CEA protein include, but are not limited to: conservative amino acid substitutions, 
amino-terminal truncations, carboxy-tenninal truncations, deletions, or additions, collectively referred to 
herein as 'Variants". Any such biologically active fragment and/or mutant will encode either a protein or 
protein fragment which at least substantially mimics the immunological properties of the CEA protein as 
set forth in SEQ ID NO:2. The synthetic polynucleotides of the present invention encode mRNA 
molecules that express a functional human CEA'protein so as to be useful in the development of a 
therapeutic or prophylactic cancer vaccine. 

As stated above, the present invention relates to nucleotides encoding a human CEA 
protein (SEQ ID NO:2), or a biologically active fragment or mutant form thereof. To this end, the present 
invention provides nucleotides encoding hCEA-AAD (SEQ ID NO:16, FIGURE 10B), which comprises a 
human CEA protein that is deleted of its C-terminal anchoring sequence. The nucleic acid molecules of 
the present invention encoding hCEA-AAD are codon-optimized for enhanced expression in human cells. 
An exemplary nucleic acid molecule encoding hCEA-AAD comprises a sequence of nucleotides as set 
forth in SEQ ID NO: 15 (FIGURE 10A). 

The present invention relates to an synthetic nucleic acid molecule (polynucleotide) 
comprising a sequence of nucleotides which encodes mRNA that expresses a novel hCEA protein as set 
forth in SEQ ID NO:2, wherein the synthetic nucleic acid molecule is ccKlonH3ptirnized for high-level 
expression in a human host cell. The nucleic acid molecules of the present invention are substantially 
free from other nucleic acids. 

The present invention also relates to recombinant vectors and recombinant host cells, 
both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. The synthetic DNA molecules, associated vectors, and hosts of the present invention are 
useful for the development of a cancer vaccine. 

A preferred DNA molecule of the present invention comprises the nucleotide sequence 
disclosed herein as SEQ ID NO: 1, shown in FIGURE 1, which encodes the human CEA protein shown in 
FIGURE 2 and set forth as SEQ ID NO:2. 
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A further preferred DNA molecule of the present invention comprises the nucleotide 
sequence disclosed herein as SEQ ID NO: 15, shown in FIGURE 10A, which encodes a human CEA 
variant that is deleted of its C4errninal anchoring sequence, as set forth in SEQ ID NO: 1 6, and shown in 
FIGURE 10B. 

The present invention also includes biologically active fragments or mutants of SEQ ID 
NOs: 1, which encode mRNA expressing human CEA proteins. Any such biologically active fragment 
and/or mutant will encode either a protein or protein fragment which at least substantially rmmics the 
pharmacological properties of the h(^ protein, including but not limited to the hCEA protein as set 
forth in SEQ ID NO:2. Any such polynucleotide includes but is not necessarily limited to: nucleotide 
substitutions, deletions, additions, arnino-terminal truncations and carboxy-terrininal truncations. The 
mutations of the present invention encode mRNA molecules that express a functional hCEA protein in a 
eukaryotic cell so as to be useful in cancer vaccine development. 

This invention also relates to synthetic codon-optimized DNA molecules that encode the 
hCEA protein wherein the nucleotide sequence of the synthetic DNA differs significantly from the 
nucleotide sequence of SEQ ID NO: 1, but still encodes the hCEA protein as set forth in SEQ ID NO:2. 
Such synthetic DNAs are intended to be within the scope of the present invention. Therefore, the present 
invention discloses codon redundancy that may result in numerous DNA molecules expressing an 
identical protein. Also included within the scope of this invention are mutations in the DNA sequence 
that do not substantially alter the ultimate physical properties of the expressed protein. For example, 
substitution of valine for leucine, arginine for lysine, or asparagine for glutamine may not cause a change 
in the functionality of the polypeptide. 

It is known that DNA sequences coding for a peptide may be altered so as to code for a 
peptide that has properties that are different than those of the naturally occurring peptide. Methods of 
altering the DNA sequences include but are not limited to site directed mutagenesis. Examples of altered 
properties include but are not limited to changes in the affinity of an enzyme for a substrate or receptor 
for a ligand. 

The present invention also relates to hCEAopt fusion constructs, including but not limited 
to fusion constructs which express a portion of the human CEA protein linked to various markers, 
including but in no way limited to GFP (Green fluorescent protein), the MYC epitope, GST, and Fc. Any 
such fusion construct may be expressed in the cell line of interest and used to screen for modulators of the 
human CEA protein disclosed herein. Also contemplated are fusion constructs that are constructed to 
enhance the immune response to human CEA including, but not limited to: DOM and hs P 70, and LTB. 

The present invention further relates to recombinant vectors that comprise the synthetic 
nucleic acid molecules disclosed throughout this specification. These vectors may be comprised of DNA 
or RNA. For most cloning purposes, DNA vectors are preferred. Typical vectors include plasmids, 
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modified viruses, baculovirus, bacteriophage, cosmids, yeast artificial chromosomes, and other forms of 
episomal or integrated DNA that can encode a hCEA protein. It is well within the purview of the skilled 
artisan to determine an appropriate vector for a particular gene transfer or other use. 

An expression vector containing codon-optimized DNA encoding a hCEA protein may 
be used for high-level expression of hCEA in a recombinant host cell. Expression vectors may include, 
but are not limited to, cloning vectors, modified cloning vectors, specifically designed plasmids or 
viruses. Also, a variety of bacterial expression vectors may be used to express recombinant hCEA in 
bacterial cells if desired. In addition, a variety of fungal cell expression vectors may be used to express 
recombinant hCEA in fungal cells. Further, a variety of insect cell expression vectors may be used to 
express recombinant protein in insect cells. 

The present invention also relates to host cells transformed or transfected with vectors 
comprising the nucleic acid molecules of the present invention. Recombinant host cells may be 
prokaryotic or eukaryotic, including but not limited to, bacteria such as E. coli, fungal cells such as yeast, 
mammalian cells including, but not limited to, cell lines of bovine, porcine, monkey and rodent origin; 
and insect cells including but not limited to Drosophila and silkworm derived cell lines. Such 
recombinant host cells can be cultured under suitable conditions to produce hCEA or a biologically 
equivalent form. In a preferred embodiment of the present invention, the host cell is human. As defined 
herein, the term "host cell" is not intended to include a host cell in the body of a transgenic human being, 
human fetus, or human embryos. 

As noted above, an expression vector containing DNA encoding a hCEA protein may be 
used for expression of hCEA in a recombinant host cell. Therefore, another aspect of this invention is a 
process for expressing a human CEA protein or protein variant in a recombinant host cell, comprising: (a) 
introducing a vector comprising a nucleic acid as set forth in SEQ ID NO.l or SEQ ID NO:15 into a 
suitable human host cell; and, (b) culturing the host cell under conditions which allow expression of said 
human CEA protein or CEA protein variant. 

Following expression of hCEA in a host cell, hCEA protein may be recovered to provide 
hCEA protein in active form. Several hCEA protein purification procedures are available and suitable for 
use. Recombinant hCEA protein may be purified from cell lysates and extracts by various combinations 
of, or individual application of salt fractionation, ion exchange chromatography, size exclusion 
chromatography, hydroxylapatite adsorption chromatography and hydrophobic interaction 
chromatography. In addition, recombinant hCEA protein can be separated from other cellular proteins by 
use of an immunoaffinity column made with monoclonal or polyclonal antibodies specific for full-length 
hCEA protein, or polypeptide fragments of hCEA protein. 

The nucleic acids of the present invention may be assembled into an expression cassette 
which comprises sequences designed to provide for efficient expression of the protein in a human cell. 
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The cassette preferably contains a full-length codon-optirnized hCEA gene, with related transcriptional 
and translations control sequences operatively linked to it, such as a promoter, and termination sequences. 
In a preferred embodiment, the promoter is the cytomegalovirus promoter without the intron A sequence 
(CMV), although those skilled in the art will recognize that any of a number of other known promoters 
such as the strong immunoglobulin, or other eukaryotic gene promoters may be used. A preferred 
transcriptional terminator is the bovine growth hormone terminator, although other known transcriptional 
terminators may also be used. The combination of CMV-BGH terminator is particularly preferred. 

Li accordance with this invention, the hCEAopt expression cassette is inserted into a 
vector. The vector is preferably an adenoviral vector, although linear DNA linked to a promoter, or other 
vectors, such as adeno-associated virus or a modified vaccinia virus, retroviral or lentiviral vector may 
also be used. 

If the vector chosen is an adenovirus, it is preferred that the vector be a so-called first- 
generation adenoviral vector. These adenoviral vectors are characterized by having a non-functional El 
gene region, and preferably a deleted adenoviral El gene region. In some embodiments, the expression 
cassette is inserted in the position where the adenoviral El gene is normally located. In addition, these 
vectors optionally have a non-functional or deleted E3 region. It is preferred that the adenovirus genome 
used be deleted of both the El and E3 regions (AE1AE3). The adenoviruses can be multiplied in known 
cell lines which express the viral El gene, such as 293 cells, or PERC.6 cells, or in cell lines derived from 
293 or PERC.6 cell which are transiently or stablily transformed to express an extra protein. For 
examples, when using constructs that have a controlled gene expression, such as a tetracycline regulatable 
promoter system, the cell line may express components involved in the regulatory system. One example 
of such a cell line is T-Rex-293; others are known in the art 

For convenience in manipulating the adenoviral vector, the adenovirus may be in a 
shuttle plasmid form This invention is also directed to a shuttle plasmid vector which comprises a 
plasmid portion and an adenovirus portion, the adenovirus portion comprising an adenoviral genome 
which has a deleted El and optional E3 deletion, and has an inserted expression cassette comprising 
codon-optrarized human CEA. In preferred embodiments, there is a restriction site flanking the 
adenoviral portion of the plasmid so that the adenoviral vector can easily be removed. The shuttle 
plasmid may be replicated in prokaryotic cells or eukaryotic cells. 

In a preferred embodiment of the invention, the expression cassette is inserted into the 
pMRKAd5-HV0 adenovirus plasmid (See Emini et al, WO 02/22080, which is hereby incorporated by 
reference). This plasmid comprises an Ad5 adenoviral genome deleted of the El and E3 regions. The 
design of the P MRKAd5-HV0 plasmid was improved over prior adenovectors by extending the 5' cis- 
acting packaging region further into the El gene to incorporate elements found to be important in 
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optimizing viral packaging, resulting in enhanced virus amplification. Advantageously, this enhanced 
adenoviral vector is capable of maintaining genetic stability following high passage propagation. 

Standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the adenoviruses, shuttle plasmids, and DNA immunogens of this invention. 
5 It has been determined in accordance with the present invention that the synthetic cDNA 

molecule described herein (SEQ ED NO: 1), which is codon-optimized for high-level expression in a 
human cell, is expressed with greater efficiency than the corresponding wild type sequence. Surprisingly, 
the codon optimized cDNA of hCEA breaks tolerance to hCEA more efficiently than the wild type 
sequence. Additionally, it was shown herein that hCEAopt is more immunogenic that hCEA and is more 
10 efficient m eliciting both cellular and humoral immune responses. 

Therefore, the vectors described above may be used in immunogenic compositions and 
vaccines for preventing the development of adenocarcinomas associated with aberrant CEA expression 
and/or for treating existing cancers. The vectors of the present invention allow for vaccine development 
and commercialization by eliminating difficulties with obtaining high expression levels of exogenous 
15 CEA in successfully transformed host organisms. To this end, one aspect of the instant invention is a 
method of preventing or treating cancer comprising administering to a mammal a vaccine vector 
comprising a synthetic colon-optimized nucleic acid molecule, the synthetic codonnoptimized nucleic 
acid molecule comprising a sequence of nucleotides that encodes a human CEA protein as setforth in 
SEQ ID NO:2. 

20 m accordance with the method described above, the vaccine vector may be administered 

for the treatment or prevention of cancer in any mammal. In a preferred embodiment of the invention, the 
mammal is a human. 

Further, one of skill in the art may choose any type of vector for use in the treatment and 
prevention method described. Preferably, the vector is an adenovirus vector or a plasmid vector. In a 
25 preferred embodiment of the invention, the vector is an adenoviral vector comprising an adenoviral 

genome with a deletion in the adenovirus El region, and an insert in the adenovirus El region, wherein 
the insert comprises an expression cassette comprising: (a) a synthetic codon-optimized polynucleotide 
encoding a human CEA protein; and (b) a promoter operably linked to the polynucleotide. 

The instant invention further relates to an adenovirus vaccine vector comprising an 
30 adenoviral genome with a deletion in the El region, and an insert in the El region, wherein the insert 
comprises an expression cassette comprising: (a) a synthetic codon-optimized polynucleotide encoding a 
human CEA protein; and (b) a promoter operably linked to the polynucleotide. 

In a preferred embodiment of this aspect of the invention, the adenovirus vector is an Ad 

5 vector. 
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In another preferred embodiment of the invention, the adenovirus vector is an Ad 6 

vector. 

In yet another preferred embodiment, the adenovirus vector is an Ad 24 vector. 

In another aspect, the invention relates to a vaccine plasmid comprising a plasmid portion 
and an expression cassette portion, the expression cassette portion comprising: (a) a synthetic codon- 
optinrized polynucleotide encoding a human CEA protein or variant thereof; and (b) a promoter operably 

linked to the polynucleotide. 

la some embodiments of this invention, the recombinant adenovirus vaccines disclosed 
herein are used in various prime/boost combinations with a plasmid-based polynucleotide vaccine in 
order to induce an enhanced immune response. In this case, the two vectors are administered in a "prime 
and boost" regimen. For example the first type of vector is adnmristered, then after a predeterrnined 
amount of time, for example, 2 weeks, 1 month, 2 months, six months, or other appropriate interval, a 
second type of vector is administered. Preferably the vectors carry expression cassettes encoding the 
same polynucleotide or combination of polynucleotides. In the embodiment where a plasmid DNA is 
also used, it is preferred that the vector contain one or more promoters recognized by mammalian or 
insect cells. In a preferred embodiment, the plasmid would contain a strong promoter such as, but not 
limited to, the CMV promoter. The synthetic human CEA gene or other gene to be expressed would be 
linked to such a promoter. An example of such a plasmid would be the mammaUan expression plasmid 
VlJns as described (J. Shiver et. al. in DNA Vaccines, M. Liu et al. eds., N.Y. Acad. Sci., N.Y., 772:198- 
208 (1996), which is herein incorporated by reference). 

As stated above, an adenoviral vector vaccine and a plasmid vaccine may be 
administered to a vertebrate as part of a single therapeutic regime to induce an immune response. To this 
end, the present invention relates to a method of protecting a mammal from cancer comprising: (a) 
introducing into the mammal a first vector comprising: i) a synthetic codon-optimized polynucleotide 
encoding a human CEA protein or human CEA protein variant; and ii) a promoter operably linked to the 
polynucleotide; (b) allowing a predetermined amount of time to pass; and (c) introducing into the 
mammal a second vector comprising: i) a synthetic codon-optimized polynucleotide encoding a human 
CEA protein or human CEA protein variant; and ii) a promoter operably linked to the polynucleotide. 

In one embodiment of the method of protection described above, the first vector is a 
plasmid and the second vector is an adenovirus vector. In an alternative embodiment, the first vector is 
an adenovirus vector and the second vector is a plasmid. 

The instant invention further relates to a method of treating a mammal suffering from an 
adenocarcinoma comprising: (a) introducing into the mammal a first vector comprising: i) a synthetic 
codon-optimized polynucleotide encoding a human CEA protein or human CEA protein variant; and ii) a 
promoter operably linked to the polynucleotide; (b) allowing a predetermined amount of time to pass; and 
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(c) introducing into the mammal a second vector comprising: i) a synthetic codon-optiniized 
polynucleotide encoding a human CEA protein or human CEA protein variant; and ii) a promoter 
operably linked to the polynucleotide. 

In one embodiment of the method of treatment described above, the first vector is a 
plasmid and the second vector is an adenovirus vector. In an alternative embodiment, the first vector is 
an adenovirus vector and the second vector is a plasmid. 

The amount of expressible DNA or transcribed RNA to be introduced into a vaccine 
recipient will depend partially on the strength of the promoters used and on the immunogenicity of the 
expressed gene product. In general, an immunologically or prophylactically effective dose of about 1 ng 
to 100 mg, and preferably about 10 jig to 300 ug of a plasmid vaccine vector is administered directly into 
muscle tissue. An effective dose for recombinant adenovirus is approximately 106 - 10*2 particles and 
preferably about 10?— lOUparticles. Subcutaneous injection, intradermal introduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, or inhalation 
delivery are also contemplated. It is also contemplated that booster vaccinations may be provided. 
Parenteral administration, such as intravenous, intramuscular, subcutaneous or other means of 
administration with adjuvants such as interleukin 12 protein, concurrently with or subsequent to 
parenteral introduction of the vaccine of this invention is also advantageous. 

The vaccine vectors of this invention may be naked, i.e., unassociated with any proteins, 
adjuvants or other agents which impact on the recipient's immune system. In this case, it is desirable for 
the vaccine vectors to be in a physiologically acceptable solution, such as, but not limited to, sterile saline 
or sterile buffered saline. Alternatively, it may be advantageous to administer an immunostimulant, such 
as an adjuvant, cytokine, protein, or other carrier with the vaccines or immunogenic compositions of the 
present invention. Therefore, this invention includes the use of such irnmunostimulants in conjunction 
with the compositions and methods of the present invention. An immunostimulant, as used herein, refers 
to essentially any substance that enhances or potentiates an immune response (antibody and/or cell- 
mediated) to an exogenous antigen. Said irnmunostimulants can be administered in the form of DNA or 
protein. Any of a variety of irnmunostimulants may be employed in conjunction with the vaccines and 
immunogenic compositions of the present inventions, including, but not limited to: GM-CSF, BFNa, 
tetanus toxoid, IL12, B7.1, LFA-3 and ICAM-1. Said irnmunostimulants are well-known in the art. 
Agents which assist in the cellular uptake of DNA, such as, but not limited to calcium ion, may also be 
used. These agents are generally referred to as transfection facilitating reagents and phannaceutically 
acceptable carriers. Those of skill in the art will be able to determine the particular immunostimulant or 
phannaceutically acceptable carrier as well as the appropriate time and mode of administration. 
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All publications mentioned herein are incorporated by reference for the purpose of 
describing and disclosing methodologies and materials that might be used in connection with the present 
invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled in the art 
without departing from the scope or spirit of the invention as defined in the appended claims. 
The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

Human CEA optimized codon sequence. 

The entire hCEAopt coding sequence was synthesized and assembled by BIONEXUS 
(Oakland, CA). The hCEAopt cDNA, which carries an optimized Kozak sequence at its 5'-end, was 
constructed using oligonucleotides assembled by PCR. The assembled cDNA was inserted into the pCR- 
Blunt vector (Invitxogen, Carlsbad, CA), yielding pCR-hCEAopt. The integrity of the hCEAopt cDNA 
was determined by sequencing of both strands. 

EXAMPLE 2 

20 Plasmid Constructs and Adenovirus vectors. 

pVU/hCEAopt : Plasmid pCR-hCEAopt was digested with EcoRL for 1 hr at 37°C. The 
resulting 2156 bp insert was purified and cloned into the EcoRL site of plasmid pVUnsB (Montgomery, et 
al., DNA CellBioU 12(9):777-83(1993)). 

pVU/hCEA : Plasmid pCI/hCEA (Song et al. Regulation of T-helper-1 versus T-helper- 
25 2 activity and enhancement of tumour immunity by combined DNA-based vaccination and nonviral 

cytokine gene transfer. Gen^77ier^y7:48M92(2000))wasalgestedwim^RI. The resulting 2109 
bp insert was cloned into the EcoRL site of plasmid pVUnsA (Montgomery et al., supra). 

Ad5/hCBAopt : Plasmid pCR-hCEAopt was digested with EcoRL. The resulting 2156 bp 
insert was purified and cloned into the EcoKl of the polyMRK-AdS shuttle plasmid {See Emini et al., 
30 WO 02/22080, which is hereby incorporated by reference). 

Ad5/CEA : The shuttle plasmid pMRK-hCEA for generation of Ad5 vector was obtained 
by digesting plasmid pDeltalsplB/hCEA with Sspl and EcoRV. The 9.52 kb fragment was then ligated 
with a 1272 bp BgnL-BamHl restricted, Klenow treated product from plasmid polyMRK. A PacVStul 
fragment from pMRK-hCEA and pMRK-hCEAopt containing the expression cassette for hCEA and El 
35 flanking Ad5 regions was recorabined to Clal linearized plasmid pAd5 in BJ5183 E. colt cells. The 
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resulting plasmids were pAd5-hCEA and pAd5-hCEAopt, respectively. Both plasmids were cut with 
Pacl to release the Ad inverted terminal repeats (TTRs) and transfected in PerC-6 cells. Ad5 vectors 
amplification was carried out by serial passage. MRKAdS/hCEA and MRKAd5/hCEAopt were punfied 
through standard CsCl gradient purification and extensively dialyzed against A105 buffer (5mM Tns-Cl 
5 pH 8.0, ImM MgCl 2 , 75 mM NaCl, 5% Sucrose, 0.005 Tween 20). 

EXAMPLE 3 

CEA Expression and Detection. 

Expression of hCEA by the plasmid and Ad vectors was monitored by Western blot 

10 analysis. Plasmids were transfected in HeLa cells or PerC.6 cells with Upofectamine 2000 (Life 
Technologies, Carlsbad, CA). Adenovirus infections of PerC6 cells were performed in serum free 
medium for 30 min at 37» C then fresh medium was added. After 48 br incubation, whole cell lysates 
and culture supernatant were harvested- The CEA protein present in the cell lysates was detected by 
Western blot analysis using a rabbit polyclonal antiserum. The protein was detected as a 180-200 kDa 

15 band The secreted CEA was detected in the cell supernatants and in peripheral blood of injected mice (3 
days post injection) using the Direct Elisa CEA Kit (DBC-Diagnostics Biochem Canada Inc., Ontario, 
Canada). 

EXAMPLE 4 

20 Mice immunization. 

Female C57BL/6 mice (H-2b) were purchased from Charles River (Lecco, Italy). 

CEA.tg mice (H-2b) were provided by J. Primus (Vanderbilt University) and kept in standard conditions. 

Fifty micrograms of plasmid DNA were electroinjected in a 50ul volume in mice quadriceps as 

previously described (Rizzuto et al. Proc. Natl Acad. Set U.S.A. 96(11): 6417-22 (1999)). Ad injections 
25 were carried out in mice quadriceps in 50 volume. Humoral and cell mediated immune response were 

analyzed at the indicated time. 

EXAMPLE 5 

Codon optimized cDNA of hCEA significantly increased hCEA expression. 

30 A synthetic gene of human CEA (hCEAopt) was designed to incorporate human- 

preferred (humanized) codons for each amino acid (hereinafter aa) residue. The codon optimized cDNA 
was modified to maintain 76.8% nucleotide identity to the original clone (see FIGURE 1). The codon 
optimized cDNAs were cloned into the pVU vectors (Montgomery et al., supra), placing in front a Kozak 
optimized sequence (5'-GCCGCCACC-3', SEQ ID NO:13) and under the control of the human 

35 cytomegalovirus (CMNOfrntron A promoter plus the bovine growth hormone (BGH) termination signal. 
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The construct was named pVU/hCEAopt (see EXAMPLE 2). Additionally, an Adenovirus type 5 vector 
was constructed carrying the hCEAopt sequence flanked by the CMV/intron A promoter and the BGH 
tennination signal (Ad5/hCEAopt). For comparison, the equivalent plasmid and Ad5 vectors were 
constructed carrying the wild type hCEA sequence yielding pVl J/hOBA and Ad5/hCEA. Similar to those 

5 containing the codon optimized cDNA, these vectors carry the wild type gene under the control of the 
CMV/int A promoter with BGH termination signal. 

Western blot analysis of HeLa cells transfected with plasmid pVl J/hCEAopt yielded a 
protein with large molecular mass (180-200 kDa) that was indistinguishable in size from that detected in 
cells transfected with construct pVU/hCEA. Similarly, no apparent differences could be detected in the 

10 size of the protein detected in PerC-6 cell lysates that had been infected with Ad5/hCEA or 
Ad5H7hCEAopt (data not shown). 

To compare the efficiency of expression of the hCEAopt to that of hCEA, groups of 10 
C57BL/6 mice were injected into the quadriceps with different doses of the Ad5/hCEAopt vector ranging 
from 1x107 to 1x104 pfu. Three days post injection, CEA protein levels were determined and compared 

15 to those of control groups that had been injected with the same doses of Ad5/hCEA. A sixfold increase in 
the geometric mean values of hCEA levels was observed upon injection of lxlO 7 pfu of Ad/hCEAopt 
(48.2 jig/1) relative to the Ad5/hCEA injected mice, whereas a tenfold increase in protein level was 
observed upon injection of 1x106 pfu of the same virus (19.1 jxg/1) (FIGURE 2A). In contrast, injection 
of lower doses of Ad5/hCEAopt did not result in a substantial increase in circulating CEA levels as 

20 compared to Ad5/hCEA. The enhancement of CEA protein levels was also noted, albeit to a lower 

extent, upon electroinjection of 25 or 50 \ig of plasmid pVU/hCEAopt relative to pVU/hCEA (FIGURE 
2B). Thus, these results indicate that, independently of the gene transfer vehicle utilized, the codon 
optimized cDNA is expressed with greater efficiency than the corresponding wild type sequence. 

25 EXAMPLE 6 

IFN-y ELISPOT Assay. 

Ninety-six wells MAIP plates (Millipore, Bedford, MA) were coated with 100 \iV well of 
purified rat anti-mouse IFN-y (IgGl, clone R4-6A2, Pharmingen, San Diego, CA) diluted to 2.5 Jig/ml in 
sterile PBS. After washing with PBS, blocking of plates was carried out with 200 jutl/well of R10 medium 

30 for2hrsat3rC. 

Splenocytes were obtained by removing the spleen from the euthanized mice in a sterile 
manner. Spleen disruptionwas carried out by grating the dissected spleen on a metal grid. Red blood 
cells were removed by osmotic lysis by adding 1 ml of 0. IX PBS to the cell pellet and vortexing no more 
than 15 seconds. One ml of 2X PBS was then added and the volume was brought to 4 ml with IX PBS. 
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Cells were pelleted by centrifugation at 1200 rpm for 10 min at room temp., and the pellet was 
resuspended in 1 ml R10 medium. Viable cells were counted using Turks staining. 

Splenocytes were plated at 5x10^ and 2xl0 5 cells/well in duplicate and incubated for 20 
h at 37°C with l^ig/ml suspension of each peptide. Concanavalin A (ConA) was used as positive internal 
5 control for each mouse at 5 ^ig/ml. After washing with PBS, 0.05% Tween 20, plates were incubated O/N 
at 4°C with 50 ul/well of biotin-conjugated rat anti-mouse IFNy (RatlgGl, clone XMG 1.2, PharMingen) 
diluted to 1:2500 in Assay buffer. After extensive washing, plates were developed by adding 50 uj/well 
NBT/B-CIP (Pierce Biotechnology Inc., Rockford, IL) until development of spots was clearly visible. 
The reaction was stopped by washing plates thoroughly with distilled water. Plates were air dried and 
10 spots were then counted using an automated EUSPOT reader. 

EXAMPLE 7 

Intracellular Cytokine Staining. 

One to two million mouse splenocytes or PBMC in 1ml RPMI 10% FCS were incubated 

15 with a pool of peptides (5-6 u,g/ml final concentration of each peptide) and brefeldin A (1 Jig/ml; BD 
Pharmingen cat #555028/2300kk) and 5% CO2 for 12-16 hours at 37°C. Cells were then washed with 
FACS buffer (PBS 1% FBS, 0.01% NaN3) and incubated with purified anti-mouse CD16/CD32 Fc block 
(BD Pharmingen cat # 553142) for 15 min at 4°C. Cells were then washed and stained with surface 
antibodies: CD4-PE conjugated anti-mouse (BD Pharmingen, cat.# 553049), PercP CD8 conjugated anti 

20 mouse (BD Pharmingen cat# 553036) and APC- conjugated anti-mouse CD3e (BD Pharmingen cat# 
553066) for 30 minutes at room temperature in the dark. After the washing, cells were fixed and 
permeabilized with Cytofix-Cytoperm Solution (BD Pharmingen cat #555028/2300kk) for 20 min at 4°C 
in the dark. After washing with PermWash Solution (BD Pharmingen cat #555028/2300kk) cells were 
incubated with the IFNy-FITC antibodies (BD Pharmingen). Cells were then washed, fixed with 

25 formaldehyde 1% in PBS and analyzed on a FACS-Calibur flow cytometer, using CellQuest software 
(Becton Dickinson, San Jose, CA). 

EXAMPLE 8 

Identification and characterization of epitope containing peptides for direct enumeration of CEA-specific 
30 T cells 

To better characterize the immune response elicited upon genetic vaccination against 
CEA in mice, ELISPOT analysis was carried out on C57BL/6 and BALB/c mice to identify CD4+ and 
CD8+ CEA specific epitopes. To this end, different immunization modalities were compared to generate 
highly immunized mice that could be utilized to identify responses to individual peptides that cover the 
35 entire protein. In view of recent reports that indicate that high levels of cellular immunity can be induced 
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against viral and bacterial antigens by utilizing plasmid DNA prime-Ad boost modality, the same 
immurnzation protocol was employed in this study. Mice were immunized intramuscularly by different 
regimens: i) two doses of 1x109 vp of Ad/hCEA (Ad/Ad), ii) two doses of plasmid pVl J/hCEA 
(DNA/DNA) and iii) a dose of plasmid DNA followed by Ad/hCEA (DNA/Ad). The immunizations 
were two weeks apart. 

The cellular immunity elicited by the different immunization regimes was measured by 
ELISPOT assay 2 weeks after the boost. To compare the immunogenic efficiency of the different 
vaccination regimens, a pool of 15mer peptides overlapping by 1 1 aa and covering aa 497-703 (pool D) 
were used to stimulate antigen specific cytokine secretion from splenocytes. The most vigorous 
responses, indicated by the higher geometric mean values of the SFC, were observed in C57BL/6 and 
B ALB/c mice from the DNA/Ad injected group (FIGURE 4). Thus, this regimen was utilized to further 
analyze the immune response. 

To determine whether the immune response was equally distributed across the entire 
CEA protein, splenocytes from immunized C57BL76 and B ALB/c mice were stimulated in vitro with one 
of four pools of 15-mer peptides that collectively encompass the entire protein sequence. Each pool 
consisted of peptides 15 amino acids long that overlap by 11 residues. Lyophilized hCEA peptides were 
purchased from Bio-Synthesis (Lewisville, TX) and resuspended in DMSO at 40 mg/ml. In addition to 
pool D, pools A (aa 1 to 147), B (aa 137 to 237), and C (aa 317 to 507) were used in this study. Final 
concentrations were the following: pool A=1.2 mg/ml, pool B= 0.89 mg/ml, pool C= 0.89 mg/ml, pool 
D= 0.8 mg/ml. Peptides were stored at -80°C. 

The immune response elicited by the DNA/Ad vaccination regimen in C57BL/6 mice 
was primarily biased towards the C-terminal region of the protein (see FIGURE 5A). Significant SFC 
values were obtained with peptide pool C and D (geometric mean values: 170 and 244 SFC/106 
splenocytes, respectively), whereas pool A and B yielded much lower values (10 and 27 SFC/106 
splenocytes, respectively). In contrast, the immune response in B ALB/c mice was highest with pool B 
(geometric mean value: 1236 SFC/IO^ splenocytes), although pool A, C, and D showed significant SFC 
values (93, 263, and 344 respectively) (FIGURE 5B). No responses against a pool of unrelated peptides 
were noted in both groups of mice (data not shown). 

To identify the individual peptides present in the peptide pools that elicit the responses, 
spleens from 4 mice immunized with the DNA/Ad vaccination regimen were analyzed in a JFNy- 
ELISPOT assay against each of the individual peptides comprising the pools against which a significant 
immune response had been observed. Splenocytes from C57BL/6 mice were tested against peptides 80 to 
173 included in pool C and D. Splenocytes from BALB/c mice were tested against peptides 35 to 173 
that comprise pools B, C, and D. CEA specific responses in C57BL/6 mice were mapped to four pairs of 
15-mer peptides that had overlapping sequences (aa 431 to 435 and 425 to 439; 529 to 543, and 533 to 
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547; 565 to 579, and 569 to 593; 613 to 627 and 617 to 631) (FIGURE 6A). The immune response to 
CEA in BALB/c mice was mapped to 22 different peptides, 17 of which have overlapping sequences (aa 
213 to 227, and 213 and 227; 229 to 243, and 233 to 247; 409 to 423 and 413 to 427; 421 to 435 and 425 
to 439; 565 to 579 and 569 to 583; 573 to 587; 613 to 627 and 617 to 631, and 621 to 635 and 625 to 639; 
637 to 651 and 641 to 655) (FIGURE 6B). 

To define the T-cell specificity of the epitopes contained within the selected peptides, 
IFNy intracellular s taining assay was carried out on splenocytes from injected mice. The results obtained 
are shown in FIGURE 7. The data indicate that CD8+ and CD4+ specific epitopes have been identified 
for both C57BL/6 and BALB/c which can be used to quantify circulating levels of T-lymphocytes. 



EXAMPLE 9 

Codon optimized hCEA cDNA breaks tolerance in hCEA transgenic mice. 

To determine whether the enhanced immunogenic properties of the codon optimized 
cDNA of hCEA would break tolerance to human CEA more efficiently, hCEA transgenic mice were 

15 immunized with vectors carrying either the wild type or the codon optimized hCEA sequences. These 
transgenic mice carry the entire human CEA gene plus flanking sequences and express the hCEA protein 
in the cecum and colon. Thus, this mouse line is a useful model for studying the safety and efficacy of 
immunotherapy strategies directed against this tumor self antigen (Clarke et al. Mice transgenic for 
human CEA as a model for immunotherapy. Cancer Res. 58(7): 1469-77 (1998)). 

20 As a first test, groups of 5 to 10 transgenic mice were subjected to four electroinjections 

of 50 pig plasmid DNA followed by a final injection of 1x10*0 pp G f Adenovirus. The immune response 
to hCEA was analyzed by IFNy-ELISPOT assay on pooled splenocytes from 4 injected mice. The 
immune response to hCEA was detected only with the splenocytes from the mice immunized with the 
hCEAopt cDNA (see FIGURE 8). The immune response was detected with peptides 143 and pool D, 

25 suggesting that immunization had elicited a significant CD8+ response to the C^erminal epitopes. 

The enhanced immunogenicity of the codon optimized cDNA of hCEA was also tested in 
transgenic mice using two injections of IxlO 1 ^ pp of Adenovirus vectors two weeks apart The CEA- 
specific immune response was measured by EFNy intracellular staining on pooled PBMC from 4 
immunized mice. The immune response to hCEA was detected only in mice immunized with Ad/CEAopt 

30 (FIGURE 9). As observed with the DNA plus Ad cohort, induction of CD8+ T-cells was detected with 
peptide pool D; however, a significant CD8+ response was also noted with peptide pool A. Thus, these 
results indicate that the codon optimized cDNA of hCEA is more immunogenic and breaks tolerance to 
hCEA more efficiently than the wild type sequence. 

35 EXAMPLE 10 
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Antibodies Detection and Titration. 

Sera for antibody titration were obtained by retro-orbital bleeding. ELISA plates (Nunc 
maxisorp™) were coated with 100 ng/well with CEA protein (highly pure CEA; Fitzgerald Industries 
International Inc., Concord MA), diluted in coating buffer (50mM NaHCX)3, pH 9.4) and incubated O/N 

5 at 4°C. Plates were then blocked with PBS containing 5% BSA for 1 hr at 37°C Mouse sera were diluted 
in PBS 5% BSA (dilution 1/50 to evaluate seroconversion rate; dilutions from 1:10 to 1:31,2150 to 
evaluate titer). Pre-immune sera were used as background. Diluted sera were incubated O/N at 4°C 
Washes were carried out with PBS 1% BSA, 0.05% Tween 20. Secondary antibody (goat anti-mouse, 
IgG Peroxidase, Sigma) was diluted 1/2000 in PBS, 5% BSA and incubated 2-3 hr at room temp, on a 

10 shaker. After washing, plates were developed with 100 fil/well of TMB substrate (Pierce Biotechnology, 
Inc., Rockford, IL). The reaction was stopped with 25 [il/well of 1M H2SO4 solution and plates were 
read at 450 nm/620 nm. Anti-CEA serum titers were calculated as the reciprocal limiting dilution of 
serum producing an absorbance at least 3-fold greater than the absorbance of autologous pre-immune 
serum at the same dilution. 

15 

EXAMPLE 11 

Increased Immunogenicity of hCEAopt. 

To examine in vivo immune responses induced by the wild type and codon optimized 
CEA expression vectors, C57BL/6 mice were immunized intramuscularly with different doses of 

20 Ad5/hCEAopt ranging from 1x10^ to lxK)3 pfu. As comparison, groups of 8 to 10 mice were 

immunized with Ad5/hCEA in doses ranging from 1x10^ to 1x104 pfu. Mice were subjected to two 
injections three weeks apart. Two weeks after the second immunization, splenocytes were isolated from 
each mouse. To quantify the IFNy secreting CEA-specific CD8 T-cell precursor frequencies generated by 
the Adenovirus mediated immunization, the ELISPOT assay for the H-2*> restricted T-cell epitope 

25 CGIQNSVSA (SEQ ID NO: 14, see below) was used. Immunization with 1x104 pfu elicited a 

measurable immune response yielding 53 EFNy spot forming cells (SFC, geometric mean value) specific 
for the CGIQNSVSA epitope (SEQ ID NO: 14), whereas injection of 1x103 pfu elicited negligible SFC 
values (FIGURE 3A). The SFC increased to 302 in the group immunized with 1x105 pfu of 
Ad/hCEAopt. In contrast, at least lxl(>5 pfu of Ad5/hCEA were necessary to elicit a significant CD8 T- 

30 cell precursor frequencies that increased to 168 SFC in the mouse group immunized with a dose of 1x10^ 
pfu. No peptide-specific IFNy SFC were detected in the Ad5 immunized mice (data not shown). 

Sera from mice immunized with 1x10^ pfu of each hCEA Adenovirus vector were tested 
in ELISA using the purified human CEA protein as substrate (FIGURE 3B). CEA-specific antibody titer 
in Ad5/hCEAopt immunized mice was detected in all immunized mice and the geometric mean value of 

35 the Ab titer was 46,474. In contrast, the Ad5/hCEA immunized group showed an approximately 100 fold 
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lower geometric mean titer of CEA-specific antibody (454). Thus, these results demonstrate that the 
codon optimized cDNA of CEA is more efficient in eliciting an cellular and humoral immune response. 

EXAMPLE 12 

5 Statistical Analysis. 

Where indicated, results were analyzed by the Student t test. A p value < 0.05 was 
considered significant 
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WHAT IS CLAIMED IS: 

1. A synthetic nucleic acid molecule comprising a sequence of nucleotides that 
encodes a human carcinoembryonic antigen (CEA) protein as set forth in SEQ ID NO:2, the synthetic 
nucleic acid molecule being codonK>ptimized for high level expression in a human cell. 

2. The synthetic nucleic acid molecule of claim 1 wherein the nucleic acid is DNA- 

3. The synthetic nucleic acid molecule of claim 1 wherein the nucleic acid is 



mRNA. 



cDNA. 



4. The synthetic nucleic acid molecule of claim 1 wherein the nucleic acid is 



5. The synthetic nucleic acid molecule of claim 1 wherein the sequence of 
nucleotides comprises the sequence of nucleotides set forth in SEQ ID NO: 1. 

6. A vector comprising the nucleic acid molecule of claim 1 . 

7. A host cell comprising the vector of claim 6. 

8. A process for expressing a human carcinoembryonic antigen (CEA) protein in a 

recombinant host cell, comprising: 

(a) introducing a vector comprising the nucleic acid of claim 1 into a suitable host 

cell; and, 

(b) culturing the host cell undo: conditions which allow expression of said human 

CEA protein. 

9. A method of preventing or treating cancer comprising administering to a 
mammal a vaccine vector comprising a synthetic cod\>n<>ptimized nucleic acid molecule, the nucleic acid 
molecule comprising a sequence of nucleotides that encodes a human carcinoembryonic antigen (hCEA) 
protein as set forth in SEQ ID NO:2 or a CEA protein variant as set forth in SEQ ID NO: 16. 

10. A method according to claim 9 wherein the mammal is human. 
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11. A method according to claim 9 wherein the vector is an adenovirus vector or a 

plasmid vector. 

12. A method according to claim 9 wherein the vector is an adenoviral vector 
5 comprising an adenoviral genome with a deletion in the adenovirus El region, and an insert in the 

adenovirus El region, wherein the insert comprises an expression cassette comprising: 

(a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

(b) a promoter operably linked to the polynucleotide. 

10 

13. A method according to claim 9 wherein the vector is a plasmid vaccine vector, 
which comprises a plasmid portion and an expressible cassette comprising 

(a) a cc>don-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

15 (b) a promoter operably linked to the polynucleotide. 

14. An adenovirus vaccine vector comprising an adenoviral genome with a deletion 
in the El region, and an insert in the El region, wherein the insert comprises an expression cassette 
comprising: 

20 (a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

(b) a promoter operably linked to the polynucleotide. 

15. An adenovirus vector according to claim 14 which is an Ad 5 vector. 

25 

16. An adenovirus vector according to claim 14 which is an Ad 6 vector. 

17. An adenovirus vector according to claim 14 which is an Ad 24 vector. 

30 18. A vaccine plasmid comprising a plasmid portion and an expression cassette 

portion, the expression cassette portion comprising: 

(a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

(b) a promoter operably linked to the polynucleotide. 

35 
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19. A method of protecting a maTnmal from cancer comprising: 

(a) introducing into the mammal a first vector comprising: 

(i) a codon^ptimized polynucleotide encoding a human carcinoembryomc 
antigen (CEA) protein or variant thereof; and 
5 (ii) a promoter operably linked to the polynucleotide; 

(b) allowing a predetermined amount of time to pass; and 

(c) introducing into the mammal a second vector comprising: 

(i) a cc«ion-optimized polynucleotide encoding a human CEA protein or 

variant thereof; and 

10 (ii) a promoter operably linked to the polynucleotide. 

20. A method according to claim 19 wherein the first vector is a plasmid and the 
second vector is an adenovirus vector. 

15 21. A method according to claim 19 wherein the first vector is an adenovirus vector 

and the second vector is a plasmid. 

22. A method of treating a mammal suffering from a colorectal carcinoma 

comprising: 

20 (a) introducing into the mammal a first vector comprising: 

(i) a codon-optimized polynucleotide encoding a human CEA protein or 

variant thereof; and 

(ii) a promoter operably linked to the polynucleotide; 
(b) allowing a predetermined amount of time to pass; and 

25 (c) introducing into the mammal a second vector comprising: 

(i) a codon-optimized polynucleotide encoding a human CEA protein or 

variant thereof; and 

(ii) a promoter operably linked to the polynucleotide. 

30 23. A method according to claim 22 wherein the first vector is a plasmid and the 

second vector is an adenovirus vector. 

24. A method according to claim 22 wherein the first vector is an adenovirus vector 
and the second vector is a plasmid. 

35 
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25. A synthetic nucleic acid molecule comprising a sequence of nucleotides that 
encodes a human carcinoembryonic antigen (CEA) protein variant as set forth in SEQ ID NO: 16, the 
synthetic nucleic acid molecule being codon-optimized for high level expression in a human cell. 

26. The synthetic nucleic acid molecule of claim 25 wherein the sequence of 
nucleotides comprises the sequence of nucleotides set forth in SEQ ID NO: 15. . 



27. 



A vector comprising the nucleic acid molecule of claim 25. 



28. 



A host cell comprising the vector of claim 27. 
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FIGURE 1. Human Codon-Optimized CEA Sequence 

1 MetGluSerProSerAlaProProHisArgTrp^ 20 
1 ATGGAGTCTCCCTCGGCCCCTCCCCACAGATGGT^ 60 
AGC. . .AGC C :C.C .- C.C. .G G 

21 ThrAlaSerLeiiLeuIlirPheTrpAsnProProThrThrAlaLysLeuThrlleGluSer 40 
61 ACAGCCTCACTTCTAACCTTCTGGAACCCGCC 120 
. .C. . .AGC. .G. .G C C G. .C . .C . .GAG. 

41 ThrProPheAsnValAlaGluGlyliysGluValLeuLeiiLeuValHisAsiLLeuProGln 60 
121 ACGCCGTTCAATGTCGCAGAGGGGAAGGAGGTGCTTCTACTTGTC 180 
. .C. .C C..G. .C C G. .G. .G. .G C 

61 Hi sLeuPheGlyTyr S erTrpTyrLysGlyGluAr gValAspGlyAsnAr gGlnl 1 el 1 e 80 
181 CATCTTTTTGGCTACAGCTGGTACAAAGCT 240 
. .C. .G. .C G. -C. .GC.C C C. .G. .C. .C 

81 GlyTyrVallleGlyThrGlnGlnAlaThrProGlyProAlaTyrSerGlyArgGlu 100 
241 GGATATGTAATAGGAACTCAACAAGCTACCCCAGGGCCCGCATACAGTGGTCGAGAGATA 300 
. .C. .C. .G. .C. .C. .C. .G. .G. .C C. .C C C. .C. .C C 

101 HeTyrProAsnAlaSerLeuLeuIleGlnAsnllelleGlnAsnAspThrGlyPheTyr 120 
301 ATATACCCCAATGCATCCCTGCTGATCCAGAACATCATCCAGAATGACACAGGATTCTAC 360 
..C C..CAG C C..C 

121 ThrLeuHisVallleLysSerAspLeuValAsnGluGliiAlaThrGlyGlnPheArgVal 140 
361 ACCCTACACGTC^TAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGCCAGTTCCGGGTA 420 
G G. .C. . .AGC. .C. .G C. .G. .G. .C. .C C. .G 

141 TyrProGluLeuProLysProSerlleSerSerAsnAsnSerLysProValGliiAspLys 160 
421 TACCCGGAGCTGCCC^GCCCTCCATCTCCAGCAACAACTCC^^CCCGTGGAGGACAA^ 480 
C AG AG AG. . .G. . 

161 AspAlaValAlaPheThrCysGluPr oGluThrGlnAspAlaThrTyrLeuTrpTrpVal 180 
481 GATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACGCAACCTACCTGTGGTGGGTA 540 
. .C. .C C. .G. .C C C G 

181 AsnAsnGlnSerLeuProValSerProArglieuGlnlieuSer AsnGlyAsnArgThrLeu 200 
541 AACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTC 600 
C 6. .C. .6. .C. . .C.C AG. . .C C.C G 

201 ThrLeuPheAsnValThrArgAsnAspT^ 220 
601 ACTCTATTCAATGTCACAAGAAATGACACAGCAAGCTACAAATGTGAAACCCAGAACCCA 660 
. .C. .G C. .G. .CC.C. .C C. .C G. .C. .G .C 

221 ValSerAlaArgArgSerAspSerVallleLeiiAsnValLeuTyrGlyProAspAlaPro 240 
661 GTGAGTGCCAGGCGCAGTGATTCAGTCATCCTGAATGTCCTCTATGGCCCGGATGCCCCC 720 
C. . .C.C C. .CAGC. .G C. .G. .G. .C C. -C 

241 ThrlleSerProLeiiAsnThrSerTyrArgSerGlyGluAsnLeuAsiiLeuSerCysHi^ 260 
721 ACCATTTCCCCTCTAAA(^CATCTTACAGATC^GGGGAAAATCTGAACCTCTCCTGCCAC 780 
CAG. . .C. .G CAGC. . .C.CAGC. .C. .G. .C GAG 

261 AlaAlaSerAsnProProAlaGlnTyrSerTrpPheValAsnGlyThr PheGlnGlnSer 280 
781 GCAGCCTCTAACCCACCTGCACAGTACTCTTGGTTTGTCAATGGGACTTTCCAGCAATCC 840 
. .C. . .AGC C. .C. .C AGC C. .G. .C. .C. .C GAG. 

281 ThrGlnGluLeuPhel leProAsnlleThrValAsnAsnSerGlySerTyrThrCysGln 300 
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.841 ACCCAAGAGCTCTTTATCCCC^CATCACTGTGAATAATAGTGGATCCTATACGTGCCAA 900 
G G. .C C C .C .C .CAG. ..C.C G 

301 Al aHi s AsnSer AspThrGlyLeiaAsnArgThrThrValThrThr 1 1 eThrVa lTyrAl a 320 
901 GCCCATAACTCAGACACTGGCCTCAATAGGACCACAGTCACGACGATCACAGTCTA 960 
C. . .AGC C G. .CCC C. .G. .C. .C C.G. .C .C 

321 GluProProLysProPhelleThrSerAsnAsnSerAsnProValGlxiAspGlxiAspAla 340 
961 GAGCCACCCAAACCCTTCATCACCAGCAACAACTCCAACCCCGTGGAGGATGAGGATGCT 1020 
C G AG C C. .C 

341 VaLAlaLeuThrCysGluProGluIleGlnAsnThrThrTyrlieuTrpTrpValAsnAsn 360 
1021 GTAGC C TT AACC TGTGAAC CTGAGATTC AG AAC AC AAC C TAC C TGTGGTGGGTAAAT AAT 1080 
. .6. . .C.G C..G..C C C G..C..C 

3 61 GlnSerLeuProValSerProArgLeuGlnLeuSerAsnAspAsnArgThrLeuThrLeu 380 
1081 CAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGACAACAGGACCCTCACTCTA 1140 
G- .C .G. .C. . .C.C AG. . .C C.C G. .C. .G 

381 LeuSerValThrArgAsnAspValGlyProTyrGluCysGlylleGlnAsnGluLeuSer 400 
1141 CTCAGTGTCACAAGGAATGATGTAGGACCCTATGAGTGTGGAATCCAGAACGAATTAAGT 1200 
..G..C.G..CCC.C.C.G..C C C..C GCG. .C 

401 ValAspHisSerAspProVallleLeiaAsnValLeuTyrGlyProAspAspProThrlle 42 0 
1201 GTTGACCACAGCGACCCAGTCATCCTGAATGTCCTCTATGGCCCAGACGACCCCACCATT 1260 
. .G C. .G C. -G. .G. .C C C 

421 SerProSerTyrThrTyrTyrArgProGl^ 440 
1261 TCCCCCTCATACACCTATTACCGTCCAGGGGTGAACCTCAGCCTCTCCTGCCATGCAGCC 1320 
AG AGC C C .C .C G GAG C.C... 

441 SerAsnProProAlaGlnTyrSerTrpLeuI leAspGlyAsnlleGlnGlnHi sThrGln 460 
1321 TC TAACCC ACC TGC AC AGTATTCTTGGCTGATTGATGGGAAC ATCCAGC AACACACAC AA 1380 
AGC C. .C. .C CAGC C. -C^C G C.G 

461 GliiLeuPhelleSerAsnlleThrGluIiysAsnSerGlyLeuTVrThr^sGlnAlaAsn 480 
1381 GAGCTCTTTATCTCCAACATCACTGAGAAGAACAGCGGACTCTATACCTGCCAGGCCAAT 1440 
G. .C. . .AG C C. .G. .C C 

481 AsnSerAlaSerGlyHisSerArgThrThrValLysThrlleThrValSerAlaGluLeu 500 
1441 AACTCAGCCAGTGGCCACAGCAGGACTACAGTCAAGACAATCACAGTCTCTGCGGAGCTG 1500 
. . .AGC C C.C. .C. .C . -G C C. .GAGC. .C 

501 ProLysProSerlleSerSerAsnAsnSerLysProValGluAspLysAspAlaValAla 520 
1501 CCCAAGCCCTCCATCTCCAGCAACAACTCCAAACCCGTGGAGGACAAGGATGCTGTGGCC 1560 
AG AG AG. . .G C.C 

521 PheTho^sGluProGliiAlaGlnAsnThrThrTyrLeuTrpTrpVal^ 540 
1561 TTCACCTGTGAACCTGAGGCTCAGAACACAACCTACCTGTGGTGGGTAAATGGTCAGAGC 1620 
C .G. .C C C G. .C .C 

541 LeuProVal Ser ProArgLeuGlnLeuSerAsnGlyAsnArgThrLeuThrLeuPheAsn 560 
1621 CTCCCAGTCAGTCCCAGGCTGCAGCTGTCCAATGGCAAC^GGACCCTCACTCTATTCAAT 1680 
..G..C..6..C...C.C AG. . .C C.C G. .C ,G C 

561 ValThr ArgAsnAspAlaArgAlaTyrValCy sGlyl leGlnAsnSerVal SerAlaAsn 580 
1681 GTCACAAGAAATGACGCAAGAGCCTATGTATGTGGAATCCAGAACTCAGTGAGTGCAAAC 1740 
..6..CC.C..C CC.C C .G. .C .C AGC C.C... 
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581 ArgSerAspProValThrLeuAspValLeuTyrGlyProAspThrProIlelleSerPro 600 
1741 CGCA6TGACCCAC3TCACCCTGGATGTCCTCTATGGGCCGGA 1800 
C C. .G C. .G..G. .C. .C. .C CAG 

601 ProAspSerSerTyrLeuSerGlyAlaAsnL^^ 620 
1801 CCAGACTCGTCTTACCTTTCGGGAGCGAACCTCAACCTCTCCTC^ 1860 
. .C. . .AGCAGC GAGC. .C .C G GAG AGC. . .AGC. . . 

621 ProSerProGlnTyrSerTrpArglleAsnGlylleProGlnGliiHisThrGlixVallieu 640 
1861 CCATCCCCGCAGTATTCTTGGCGTATCAATGGGAT 1920 
. .CAG. . -C CAGC C C. .C..C..C G C. .G. .G. .G 

641 PhelleAlaLysIleTlxrProAsnAsnAsnGlyThrTyrAlaCysPheValSerAsn^ 660 
1921 TTTATCGCCAAAATCACGCCAAATAATAACGGGACCTATGCCTGTTTTGTCTCTAACTTG 1980 
. .C G C. .C. .C. .C C C C. .C. .GAGC. . .C. . 

661 AlaThrGlyArgAsnAsnSerlleValLysSerlleThrValSerAlaSerGlyThrSer 680 
1981 GCTACTGGCCGCAATAATTCCATAGTCAAGAGCATCACAGTCTCTGCATCTGGAACTTCT 2040 
. .C. .C C. .CAG. . .C. .G C. .GAGC. .CAGC. .C. .CAGC 

681 ProGlyLeuSerAlaGlyAlaThrValGlylleMetlleGlyValLeuValGlyValAla 700 
2041 CCTGGTCTCTCAGCTGGGGCCACTGTCGGCATCATGATTGGAGTGCTGGTTGGGGTTGCT 2100 
. .C. .C. .GAGC. .C. .C C. .G C. .C G. .C. .G. .C 



701 LeuIleEnd 703 
2101 CTGATATAG 2109 
C.GA 



(SEQ ID NO:2) 
(SEQ ID NO: 3) 
(SEQ ID NO:l) 
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FIGURE 3A. Cell-Mediated Immune Response to Human CEA 
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FIGURE 3B. Humoral Immune Response to Human CEA 
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FIGURE 4A. Comparison of Immunization Regimens Used for C57BL/6 Mice 
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FIGURE 4B. Comparison of Immunization Regimens Used for BALB/c Mice 
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FIGURE 5A. T-Cell Responses in C57BL/6 Mice 
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FIGURE 6A. Immunoresponsive Peptides of hCEA in Immunized C57BL/6 Mice 
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FIGURE 6B. Immunoresponsive Peptides of hCEA in Immunized BALB/c Mice 
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FIGURE 7. Sequence of Epitope-Containing Peptides for CEA 



A. C57BL/6 



Pool 


Peptide 


Position 


Sequence 


SEQIDNO. 


CD8 


CD4 


c 


CEA-107 


425-439 


TYYRPGVNLSLSCHA 


4 


0.02 


0.14 


D 


CEA-133 


539-543 NTTYLWWVNGQSLPV 


5 


0.01 


0.04 


D 


CEA-143 


569-583 


YVCGIQNSVSANRSD 


6 


0.64 


0.01 


D 


CEA-155 


617-631 


SASNPSPQYSWRING 


7 


0.03 


0.06 




DMSO 








0.01 


0.01 



B. BALB/c 



Pool 


Peptide 


Position Sequence 


SEQIDNO. 


CD8 


CD4 


B 


CEA-58 


229-243 VHJSTVLYGPDAPTIS 
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0.51 . 


0.01 
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CEA-98 
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421-435 SPSYTYYRPGVNLSL 
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CEA-156 
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CEA-167 
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12 


0.13 
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FIGURE 8. IFNy-ELISPOT analysis of immunized CEA.tg mice 3 
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FIGURE 9. IFNy -Intracellular Staining of Immunized CEA.tg Mice 
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FIGURE 10A. Nucleotide Sequence of Human CEA-AAD 

1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG 
CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 
101 CCAAGCTGAC CATCGAGAGC ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG 
GTGCTGCTGC TGGTGCACAA CCTGCCCCAG CACCTGTTCG GCTACAGCTG 
201 GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC GGCTACGTGA 
TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 
301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC 
CGGCTTCTAC ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CAGCATCAGC 
AGCAACAACA GCAAGCCCGT GGAGGACAAG GACGCCGTGG CCTTCACCTG 
501 CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG AACAACCAGA 
GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC 
CCAGAACCCC GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC 
701 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CAGCTACCGC 
AGCGGCGAGA ACCTGAACCT GAGCTGCCAC GCCGCCAGCA ACCCCCCCGC 
801 CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 
TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAC 
CGTGTACGCC GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC 
1001 CCGTGGAGGA CGAGGACGCC GTGGCCCTGA CCTGCGAGCC CGAGATCCAG 
AACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGC CCGTGAGCCC 
1101 CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG CTGAGCGTGA 
CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 
1201 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA 
CCCCACCATC AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA 
1301 GCCTGAGCTG CCACGCCGCC AGCAACCCCC CCGCCCAGTA CAGCTGGCTG 
ATCGACGGCA ACATCCAGCA GCACACCCAG GAGCTGTTCA TCAGCAACAT 
1401 CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC AACAGCGCCA 
GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 
1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA 
CGCCGTGGCC TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT 
1601 GGTGGGTGAA CGGCCAGAGC CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC 
AACGGCAACC GCACCCTGAC CCTGTTCAAC GTGACCCGCA ACGACGCCCG 
1701 CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC CGCAGCGACC 
CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 
1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC GTGAACCTGA GCTGCCACAG 
CGCCAGCAAC CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCC 
1901 AGCAGCACAC CCAGGTGCTG TTCATCGCCA AGATCACCCC CAACAACAAC 
GGCACCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 
2001 CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGC (SEQ ID NO: 15) 
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FIGURE 10B. Amino Acid Sequence of Human CEA-AAD 

1 MESPSAPPHR WCIPWQRLLL TASIiLTFWNP PTTAKLTIES TPFNVAEGKE 

51 VLLLVHNLPQ HLFGYSWYKG ERVDGNRQII GYVJGTQQAT PGPAYSGREI 

101 IYPNASLLIQ NIIQNDTGFY TLHVIKSDLV NEEATGQFRV YPELPKPSIS 

151 SNNSKPVEDK DAVAFTCEPE TQDATYLWWV NNQSLPVSPR LQI/SNGNRTL 

201 TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP TISPLNTSYR 

251 SGENLHLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 

301 AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVKDEDA VALTCEPEIQ 

351 NTTYLWWVNN QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS 

401 VDHSDPVILN VLYGPDDPTI SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 

451 IDGNIQQHTQ ELFISNITEK NSGLYTCQAN NSASGHSRTT VKTITVSAEL 

501 PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLiWWVNGQS LPVSPRLQLS 

551 NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDYL YGPDTPIISP 

601 PDSSYLSGAN LNIiSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 

651 GTYACFVSNL ATGRNNSIVK SITVSASG (SEQ ID NO: 16) 
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SEQUENCE LISTING 

<110> Istituto Di Ricerche Di Biologia Molecolar P. Angeletti S.P.A. 
Lamonica , Nicola 
Mennuni , Carmela 
Savino, Rocco 
Lahm, Armin 

<120> SYNTHETIC GENE ENCODING HUMAN 

CARCINOEMBRYONIC ANTIGEN AND USES THEREOF 



<130> ITR0044Y 

<150> 60/467,971 
<151> 2003-05-05 

<150> Unknown 
<151> 2004-02-11 

<160> 16 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2109 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> hCEAopt 
<400> 1 

atggagagcc ccagcgcccc cccccaccgc tggtgcatcc cctggcagcg cctgctgctg 60 
accgccagcc tgctgacctt ctggaacccc cccaccaccg ccaagctgac catcgagagc 120 
acccccttca acgtggccga gggcaaggag gtgctgctgc tggtgcacaa cctgccccag 180 
cacctgttcg gctacagctg gtacaagggc gagcgcgtgg acggcaaccg ccagatcatc 240 
ggctacgtga tcggcaccca gcaggccacc cccggccccg cctacagcgg ccgcgagatc 300 
atctacccca acgccagcct gctgatccag aacatcatcc agaacgacac cggcttctac 360 
accctgcacg tgatcaagag cgacctggtg aacgaggagg ccaccggcca gttccgcgtg 420 
taccccgagc tgcccaagcc cagcatcagc agcaacaaca gcaagcccgt ggaggacaag 480 
gacgccgtgg ccttcacctg cgagcccgag acccaggacg ccacctacct gtggtgggtg 540 
aacaaccaga gcctgcccgt gagcccccgc ctgcagctga gcaacggcaa ccgcaccctg 600 
accctgttca acgtgacccg caacgacacc gccagctaca agtgcgagac ccagaacccc 660 
gtgagcgccc gccgcagcga cagcgtgatc ctgaacgtgc tgtacggccc cgacgccccc 720 
accatcagcc ccctgaacac cagctaccgc agcggcgaga acctgaacct gagctgccac 780 
gccgccagca acccccccgc ccagtacagc tggttcgtga acggcacctt ccagcagagc 840 
acccaggagc tgttcatccc caacatcacc gtgaacaaca gcggcagcta cacctgccag 900 
gcccacaaca gcgacaccgg cctgaaccgc accaccgtga ccaccatcac cgtgtacgcc 960 
gagcccccca agcccttcat caccagcaac aacagcaacc ccgtggagga cgaggacgcc 1020 
gtggccctga cctgcgagcc cgagatccag aacaccacct acctgtggtg ggtgaacaac 1080 
cagagcctgc ccgtgagccc ccgcctgcag ctgagcaacg acaaccgcac cctgaccctg 1140 
ctgagcgtga cccgcaacga cgtgggcccc tacgagtgcg gcatccagaa cgagctgagc 1200 
gtggaccaca gcgaccccgt gatcctgaac gtgctgtacg gccccgacga ccccaccatc 1260 
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agccccagct acacctacta ccgccccggc gtgaacctga gcctgagctg ccacgccgcc 1320 

agcaaccccc ccgcccagta cagctggctg atcgacggca acatccagca gcacacccag 1380 

gagctgttca tcagcaacat caccgagaag aacagcggcc tgtacacctg ccaggccaac 1440 

aacagcgcca gcggccacag ccgcaccacc gtgaagacca tcaccgtgag cgccgagctg 1500 

cccaagccca gcatcagcag caacaacagc aagcccgtgg aggacaagga cgccgtggcc 1560 

ttcacctgcg agcccgaggc ccagaacacc acctacctgt ggtgggtgaa cggccagagc 1620 

ctgcccgtga gcccccgcct gcagctgagc aacggcaacc gcaccctgac cctgttcaac 1680 

gtgacccgca acgacgcccg cgcctacgtg tgcggcatcc agaacagcgt gagcgccaac 1740 

cgcagcgacc ccgtgaccct ggacgtgctg tacggccccg acacccccat catcagcccc 1800 

cccgacagca gctacctgag cggcgccaac ctgaacctga gctgccacag cgccagcaac 1860 

cccagccccc agtacagctg gcgcatcaac ggcatccccc agcagcacac ccaggtgctg 1920 

ttcatcgcca agatcacccc caacaacaac ggcacctacg cctgcttcgt gagcaacctg 1980 

gccaccggcc gcaacaacag catcgtgaag agcatcaccg tgagcgccag cggcaccagc 2040 

cccggcctga gcgccggcgc caccgtgggc atcatgatcg gcgtgctggt gggcgtggcc 2100 

ctgatctga 2109 

<210> 2 

<211> 702 

<212> PRT 

<213> Homo sapiens 



<400> 2 



Met 


Glu 


Ser 


Pro 


Ser 


Ala 


Pro 


Pro 


His 


Arg 


Trp 


Cys 


lie 


Pro 


Trp 


Gin 


1 








5 










10 










15 




Arg 


Leu 


Leu 


Leu 


Thr 


Ala 


Ser 


Leu 


Leu 


Thr 


Phe 


Trp 


Asn 


Pro 


Pro 


Thr 








20 










25 










30 






Thr 


Ala 


Lys 


Leu 


Thr 


He 


Glu 


Ser 


Thr 


Pro 


Phe 


Asn 


Val 


Ala 


Glu 


Gly 






35 










40 










45 








Lys 


Glu 


Val 


Leu 


Leu 


Leu 


Val 


His 


Asn 


Leu 


Pro 


Gin 


His 


Leu 


Phe 


Gly 




50 










55 










60 










Tyr 


Ser 


Trp 


Tyr 


Lys 


Gly 


Glu 


Arg 


Val 


Asp 


Gly 


Asn 


Arg 


Gin 


lie 


He 


65 










70 










75 










80 


Gly 


Tyr 


Val 


He 


Gly 


Thr 


Gin 


Gin 


Ala 


Thr 


Pro 


Gly 


Pro 


Ala 


Tyr 


Ser 










85 










90 










95 




Gly 


Arg 


Glu 


He 


He 


Tyr 


Pro 


Asn 


Ala 


Ser 


Leu 


Leu 


He 


Gin 


Asn 


He 








100 










105 










110 






He 


Gin 


Asn 


Asp 


Thr 


Gly 


Phe 


Tyr 


Thr 


Leu 


His 


Val 


He 


Lys 


Ser 


Asp 






115 










120 










125 








Leu 


Val 


Asn 


Glu 


Glu 


Ala 


Thr 


Gly 


Gin 


Phe 


Arg 


Val 


Tyr 


Pro 


Glu 


Leu 




130 










135 










140 










Pro 


Lys 


Pro 


Ser 


He 


Ser 


Ser 


Asn 


Asn 


Ser 


Lys 


Pro 


Val 


Glu 


Asp 


Lys 


145 










150 










155 










160 


Asp 


Ala 


Val 


Ala 


Phe 


Thr 


Cys 


Glu 


Pro 


Glu 


Thr 


Gin 


Asp 


Ala 


Thr 


Tyr 










165 










170 










175 




Leu 


Trp 


Trp 


Val 


Asn 


Asn 


Gin 


Ser 


Leu 


Pro 


Val 


Ser 


Pro 


Arg 


Leu 


Gin 








180 










185 










190 






Leu 


Ser 


Asn 


Gly 


Asn 


Arg 


Thr 


Leu 


Thr 


Leu 


Phe 


Asn 


Val 


Thr 


Arg 


Asn 






195 










200 










205 








Asp 


Thr 


Ala 


Ser 


Tyr 


Lys 


Cys 


Glu 


Thr 


Gin 


Asn 


Pro 


Val 


Ser 


Ala 


Arg 




210 










215 










220 










Arg 


Ser 


Asp 


Ser 


Val 


lie 


Leu 


Asn 


Val 


Leu 


Tyr 


Gly 


Pro 


Asp 


Ala 


Pro 


225 










230 










235 










240 


Thr 


He 


Ser 


Pro 


Leu 


Asn 


Thr 


Ser 


Tyr 


Arg 


Ser 


Gly 


Glu 


Asn 


Leu 


Asn 










245 










250 










255 




Leu 


Ser 


Cys 


His 


Ala 


Ala 


Ser 


Asn 


Pro 


Pro 


Ala 


Gin 


Tyr 


Ser 


Trp 


Phe 
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260 265 270 

Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe He Pro Asn 

275 280 285 

He Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 

290 295 300 

Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr He Thr Val Tyr Ala 
305 310 315 320 

Glu Pro Pro Lys Pro Phe He Thr Ser Asn Asn Ser Asn Pro Val Glu 

325 330 335 

Asp Glu Asp Ala Val. Ala Leu Thr Cys Glu Pro Glu He Gin Asn Thr 

340 345 350 

Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg 

355 360 365 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 

370 375 380 

Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser 
385 390 395 400 

Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp 

405 410 415 

Asp Pro Thr He Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 

420 425 430 

Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 

435 440 445 

Trp Leu He Asp Gly Asn He Gin Gin His Thr Gin Glu Leu Phe He 

450 455 460 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
465 470 475 480 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 

485 490 495 

Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 

500 505 510 

Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 

515 520 525 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 

530 ~ 535 540 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
545 550 555 560 

Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 

565 570 575 

Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 

580 585 590 

Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 

595 600 605 

Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 

610 615 620 

Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
625 630 635 640 

Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 

645 650 655 

Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 

660 665 670 

Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 

675 680 685 

Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
690 695 700 



3/9 



WO 2004/099247 PCT7EP2004/004802 



<210> 3 

<211> 2109 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atggagtctc cctcggcccc tccccacaga tggtgcatcc cctggcagag gctcctgctc 60 
acagcctcac ttctaacctt ctggaacccg cccaccacfcg ccaagctcac tattgaatcc 120 
acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 180 
catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 240 
ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 300 
atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 360 
accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 420 
tacccggagc tgcccaagcc ctccatctcc .agcaacaact ccaaacccgt ggaggacaag 480 
gatgctgtgg ccttcacctg tgaacctgag actcaggacg caacctacct gtggtgggta 540 
aacaatcaga gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa caggaccctc 600 
actctattca atgtcacaag aaatgacaca gcaagctaca aatgtgaaac ccagaaccca 660 
gtgagtgcca ggcgcagtga ttcagtcatc ctgaatgtcc tctatggccc ggatgccccc 720 
accatttccc ctctaaacac atcttacaga tcaggggaaa atctgaacct ctcctgccac 780 
gcagcctcta acccacctgc acagtactct tggtttgtca atgggacttt ccagcaatcc 840 
acccaagagc tctttatccc caacatcact gtgaataata gtggatccta tacgtgccaa 900 
gcccataact cagacactgg cctcaatagg accacagtca cgacgatcac agtctatgca 960 
gagccaccca aacccttcat caccagcaac aactccaacc ccgtggagga tgaggatgct 1020 
gtagccttaa cctgtgaacc tgagattcag aacacaacct acctgtggtg ggtaaataat 1080 
cagagcctcc cggtcagtcc caggctgcag ctgtccaatg acaacaggac cctcactcta 1140 
ctcagtgtca caaggaatga tgtaggaccc tatgagtgtg gaatccagaa cgaattaagt 1200 
gttgaccaca gcgacccagt catcctgaat gtcctctatg gcccagacga ccccaccatt 1260 
tccccctcat acacctatta ccgtccaggg gtgaacctca gcctctcctg ccatgcagcc 1320 
tctaacccac ctgcacagta ttcttggctg attgatggga acatccagca acacacacaa 1380 
gagctcttta tctccaacat cactgagaag aacagcggac tctatacctg ccaggccaat 1440 
aactcagcca gtggccacag caggactaca gtcaagacaa tcacagtctc tgcggagctg 1500 
cccaagccct ccatctccag caacaactcc aaacccgtgg aggacaagga tgctgtggcc 1560 
ttcacctgtg aacctgaggc tcagaacaca acctacctgt ggtgggtaaa tggtcagagc 1620 
ctcccagtca gtcccaggct gcagctgtcc aatggcaaca ggaccctcac tctattcaat 1680 
gtcacaagaa atgacgcaag agcctatgta tgtggaatcc agaactcagt gagtgcaaac 1740 
cgcagtgacc cagtcaccct ggatgtcctc tatgggccgg acacccccat catttccccc 1800 
ccagactcgt cttacctttc gggagcgaac ctcaacctct cctgccactc ggcctctaac 1860 
ccatccccgc agtattcttg gcgtatcaat gggataccgc agcaacacac acaagttctc 1920 
tttatcgcca aaatcacgcc aaataataac gggacctatg cctgttttgt ctctaacttg 1980 
gctactggcc gcaataattc catagtcaag agcatcacag tctctgcatc tggaacttct 2040 
cctggtctct cagctggggc cactgtcggc atcatgattg gagtgctggt tggggttgct 2100 
ctgatatag 2109 

<210> 4 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 4 

Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu Ser Cys His Ala 
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15 10 15 



<210> 5 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 5 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val 
15 10 15 



<210> 6 . 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 6 

Tyr Val Cys Gly lie Gin Asn Ser Val Ser Ala Asn Arg Ser Asp 
15 10 15 



<210> 7 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 7 

Ser Ala Ser Asn Pro Ser Pro Gin Tyr Ser Trp Arg lie Asn Gly 
1-5 10 15 



<210> 8 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 8 

Val He Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro Thr He Ser 
1 5 10 15 
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<210> 9 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 9 

Gly Pro Tyr Glu Cys Gly lie Gin Asn Glu Leu Ser Val Asp His 
15 10 15 



<210> 10 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 10 

Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu 
15 10 15 



<210> 11 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 11 

Pro Ser Pro Gin Tyr Ser Trp Arg lie Asn Gly lie Pro Gin Gin 
15 10 15 



<210> 12 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 12 

Asn Asn Ser lie Val Lys Ser He Thr Val Ser Ala Ser Gly Thr 
15 10 15 



<210> 13 
<211> 9 
<212> DMA 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Kozak sequence 

<400> 13 
gccgccacc 

<210> 14 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> T-cell epitope 
<400> 14 

Cys Gly lie Gin Asn Ser Val Ser Ala 
1 5 



<210> 15 
<211> 2034 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> hCEA-AAD 
<400> 15 

atggagagcc ccagcgcccc cccccaccgc 
accgccagcc tgctgacctt ctggaacccc 
acccccttca acgtggccga gggcaaggag 
cacctgttcg gctacagctg gtacaagggc 
ggctacgtga tcggcaccca gcaggccacc 
atctacccca acgccagcct gctgatccag 
accctgcacg tgatcaagag cgacctggtg 
taccccgagc tgcccaagcc cagcatcagc 
gacgccgtgg ccttcacctg cgagcccgag 
aacaaccaga gcctgcccgt gagcccccgc 
accctgttca acgtgacccg caacgacacc 
gtgagcgccc gccgcagcga cagcgtgatc 
accatcagcc ccctgaacac cagctaccgc 
gccgccagca acccccccgc ccagtacagc 
acccaggagc tgttcatccc caacatcacc 
gcccacaaca gcgacaccgg cctgaaccgc 
gagcccccca agcccttcat caccagcaac 
gtggccctga cctgcgagcc cgagatccag 
cagagcctgc ccgtgagccc ccgcctgcag 
ctgagcgtga cccgcaacga cgtgggcccc 
gtggaccaca gcgaccccgt gatcctgaac 
agccccagct acacctacta ccgccccggc 
agcaaccccc ccgcccagta cagctggctg 
gagctgttca tcagcaacat caccgagaag 



tggtgcatcc cctggcagcg cctgctgctg 60 
cccaccaccg ccaagctgac catcgagagc 120 
gtgctgctgc tggtgcacaa cctgccccag 180 
gagcgcgtgg acggcaaccg ccagatcatc 240 
cccggccccg cctacagcgg ccgcgagatc 300 
aacatcatcc agaacgacac cggcttctac 360 
aacgaggagg ccaccggcca gttccgcgtg 420 
agcaacaaca gcaagcccgt ggaggacaag 480 
acccaggacg ccacctacct gtggtgggtg 540 
ctgcagctga gcaacggcaa ccgcaccctg 600 
gccagctaca agtgcgagac ccagaacccc 660 
ctgaacgtgc tgtacggccc cgacgccccc 720 
agcggcgaga acctgaacct gagctgccac 780 
tggttcgtga acggcacctt ccagcagagc 840 
gtgaacaaca gcggcagcta cacctgccag 900 
accaccgtga ccaccatcac cgtgtacgcc 960 
aacagcaacc ccgtggagga cgaggacgcc 1020 
aacaccacct acctgtggtg ggtgaacaac 1080 
ctgagcaacg acaaccgcac cctgaccctg 1140 
tacgagtgcg gcatccagaa cgagctgagc 1200 
gtgctgtacg gccccgacga ccccaccatc 1260 
gtgaacctga gcctgagctg ccacgccgcc 1320 
atcgacggca acatccagca gcacacccag 1380 
aacagcggcc tgtacacctg ccaggccaac 1440 
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aacagcgcca gcggccacag ccgcaccacc gtgaagacca tcaccgtgag cgccgagctg 1500 
cccaagccca gcatcagcag caacaacagc aagcccgtgg aggacaagga cgccgtggcc 1560 
ttcacctgcg agcccgaggc ccagaacacc acctacctgt ggtgggtgaa cggccagagc 1620 
ctgcccgtga gcccccgcct gcagctgagc aacggcaacc gcaccctgac cctgttcaac 1680 
gtgacccgca acgacgcccg cgcctacgtg tgcggcatcc agaacagcgt gagcgccaac 1740 
cgcagcgacc ccgtgaccct ggacgtgctg tacggccccg acacccccat catcagcccc 1800 
cccgacagca gctacctgag cggcgccaac ctgaacctga gctgccacag cgccagcaac 1860 
cccagccccc agtacagctg gcgcatcaac ggcatccccc agcagcacac ccaggtgctg 1920 
ttcatcgcca agatcacccc caacaacaac ggcacctacg cctgcttcgt gagcaacctg 1980 
gccaccggcc gcaacaacag catcgtgaag agcatcaccg tgagcgccag cggc 2034 

<210> 16 
<211> 678 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> hCEA-AAD 
<400> 16 

Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys lie Pro Trp Gin 

15 10 15 

Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr 

20 25 30 

Thr Ala Lys Leu Thr lie Glu Ser Thr Pro Phe Asn Val Ala Glu Gly 

35 40 45 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 

50 55 60 

Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gin lie lie 
65 70 75 80 

Gly Tyr Val He Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala Tyr Ser 

85 90 95 

Gly Arg Glu He He Tyr Pro Asn Ala Ser Leu Leu He Gin Asn He 

100 105 110 

He Gin Asn Asp Thr Gly Phe Tyr Thr Leu His Val He Lys Ser Asp 

115 120 125 

Leu Val Asn Glu Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 

130 135 140 

Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys 
145 150 155 160 

Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gin Asp Ala Thr Tyr 

165 170 175 

Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg Leu Gin 

180 185 190 

Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 

195 200 205 

Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser Ala Arg 

210 215 220 

Arg Ser Asp Ser Val lie Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro 
225 230 235 " 240 

Thr lie Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 

245 250 255 

Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Phe 

260 265 270 

Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe He Pro Asn 
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lie 


Thr 


Val 


Asn 


Asn 


Ser 


Gly 


Ser 




290 










295 




Asp 


Thr 


Gly 


Leu 


Asn 


Arg 


Thr 


Thr 


305 










310 






Glu 


Pro 


Pro 


Lys 


Pro 


Phe 


He 


Thr 










325 








Asp 


Glu 


Asp 


Ala 


Val 


Ala 


Leu 


Thr 








340 










Thr 


Tyr 


Leu 


Trp 


Trp 


Val 


Asn 


Asn 






355 










360 


Leu 


Gin 


Leu 


Ser 


Asn 


Asp 


Asn 


Arg 




370 










375 




Arg 


Asn 


Asp 


Val 


Gly 


Pro 


Tyr 


Glu 


385 










390 






Val 


Asp 


His 


Ser 


Asp 


Pro 


Val 


He 










405 








Asp 


Pro 


Thr 


He 


Ser 


Pro 


Ser 


Tyr 








420 










Leu 


Ser 


Leu 


Ser 


Cys 


His 


Ala 


Ala 






435 










440 


Trp 


Leu 


He 


Asp 


Gly 


Asn 


lie 


Gin 




450 










455 




Ser 


Asn 


He 


Thr 


Glu 


Lys 


Asn 


Ser 


465 










470 






Asn 


Ser 


Ala 


Ser 


Gly 


His 


Ser 


Arg 










485 








Ser 


Ala 


Glu 


Leu 


Pro 


Lys 


Pro 


Ser 








500 










Val 


Glu 


Asp 


Lys 


Asp 


Ala 


Val 


Ala 






515 










520 


Asn 


Thr 


Thr 


Tyr 


Leu 


Trp 


Trp 


Val 




530 










535 




Pro 


Arg 


Leu 


Gin 


Leu 


Ser 


Asn 


Gly 


545 










550 






Val 


Thr 


Arg 


Asn 


Asp 


Ala 


Arg 


Ala 










565 








Val 


Ser 


Ala 


Asn 


Arg 


Ser 


Asp 


Pro 








580 










Pro 


Asp 


Thr 


Pro 


He 


He 


Ser 


Pro 






595 










600 


Ala 


Asn 


Leu 


Asn 


Leu 


Ser 


Cys 


His 




610 










615 




Tyr 


Ser 


Trp 


Arg 


He 


Asn 


Gly 


He 


625 










630 






Phe 


He 


Ala 


Lys 


He 


Thr 


Pro 


Asn 










645 








Val 


Ser 


Asn 


Leu 


Ala 


Thr 


Gly 


Arg 








660 










Thr 


Val 


Ser 


Ala 


Ser 


Gly 







675 



Tyr 


Thr Cys 


Gin Ala His 


Asn 


Ser 






300 








Val 


Thr Thr 


He 


Thr Val 


Tyr 


Ala 




315 








320 


Ser 


Asn Asn 


Ser 


Asn Pro 


Val 


Glu 




330 






335 




Cys 


Glu Pro 


Glu 


He Gin 


Asn 


Thr 


345 






350 






Gin 


Ser Leu 


Pro Val Ser 


Pro 


Arg 








365 






Thr 


Leu Thr 


Leu 


Leu Ser 


Val 


Thr 






380 








Cys 


Gly He 


Gin Asn Glu 


Leu 


Ser 




395 








400 


Leu 


Asn Val 


Leu Tyr Gly 


Pro 


Asp 




410 






415 




Thr 


Tyr Tyr 


Arg Pro Gly 


Val 


Asn 


425 






430 






Ser 


Asn Pro 


Pro Ala Gin 


Tyr 


Ser 








445 






Gin 


His Thr 


Gin 


Glu Leu 


Phe 


He 






460 








Gly 


Leu Tyr 


Thr 


Cys Gin 


Ala 


Asn 




475 








480 


Thr 


Thr Val 


Lys 


Thr lie 


Thr 


Val 




490 






495 




He 


Ser Ser 


Asn 


Asn Ser 


Lys 


Pro 


505 






510 






Phe 


Thr Cys 


Glu 


Pro Glu 


Ala 


Gin 








525 






Asn 


Gly Gin 


Ser 


Leu Pro 


Val 


Ser 






540 








Asn 


Arg Thr 


Leu 


Thr Leu 


Phe 


Asn 




555 








560 


Tyr 


Val Cys 


Gly He Gin 


Asn 


Ser 




570 






575 




Val 


Thr Leu 


Asp Val Leu 


Tyr 


Gly 


585 






590 






Pro 


Asp Ser 


Ser Tyr Leu 


Ser 


Gly 








605 






Ser 


Ala Ser 


Asn 


Pro Ser 


Pro 


Gin 






620 








Pro 


Gin Gin 


His 


Thr Gin 


Val 


Leu 




635 








640 


Asn 


Asn Gly 


Thr 


Tyr Ala 


Cys 


Phe 




650 






655 




Asn 


Asn Ser 


He 


Val Lys 


Ser 


He 


665 






670 
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